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Abstract. We provide a complete description of the Wadge hierarchy for deterministi- 
cally recognisable sets of infinite trees. In particular we give an elementary procedure to 
decide if one deterministic tree language is continuously reducible to another. This extends 
Wagner's results on the hierarchy of tj-regular languages of words to the case of trees. 



Two measures of complexity of recognisable languages of infinite words or trees have 
been considered in literature: the index hierarchy, which reflects the combinatorial com- 
plexity of the recognising automaton and is closely related to //-calculus, and the Wadge 
hierarchy, which is the refinement of the Borel/projective hierarchy that gives the deepest 
insight into the topological complexity of languages. Klaus Wagner was the first to discover 
remarkable relations between the two hierarchies for finite-state recognisable (w-regular) 
sets of infinite words [H]. Subsequently, decision procedures determining an w-regular 
language's position in both hierarchies were given [4l \7\ \TE\. 

For tree automata the index problem is only solved when the input is a deterministic 
automaton [ULI3]. As for topological complexity of recognisable tree languages, it goes much 
higher than that of w-regular languages, which are all A3. Indeed, co-Biichi automata over 
trees may recognise n|-complete languages [8], and Skurczyhski [12] proved that there are 
even weakly recognisable tree languages in every finite level of the Borel hierarchy. This 
may suggest that in the tree case the topological and combinatorial complexities diverge. 
On the other hand, the investigations of the Borel/projective hierarchy of deterministic 
languages [U [8] reveal some interesting connections with the index hierarchy. 

Wagner's results |14l I15j . giving rise to what is now called the Wagner hierarchy 
(see [10] ) , inspire the search for a complete picture of the two hierarchies and the relations 
between them for recognisable tree languages. In this paper we concentrate on the Wadge 
hierarchy of deterministic tree languages: we give a full description of the Wadge-equivalence 
classes forming the hierarchy, together with a procedure calculating the equivalence class 
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of a given deterministic language. In particular, we show that the hierarchy has the height 
_|_ <^ which should be compared with ui u for regular w-languages [15] . ui^ for deter- 
ministic context-free w-languages [1], (uP K ) u for ^-languages recognised by deterministic 
Turing machines [11] ; or an unknown ordinal £ > (w^p ) w for w-languages recognised by 
nondeterministic Turing machines, and the same ordinal £ for nondeterministic context-free 
languages [2]. 

The key notion of our argument is an adaptation of the Wadge game to tree languages, 
redefined entirely in terms of automata. Using this tool we construct a collection of canonical 
automata representing the Wadge degrees of all deterministic tree languages. Then we 
provide a procedure calculating the canonical form of a given deterministic automaton, 
which runs within the time of finding the productive states of the automaton (the exact 
complexity of this problem is unknown, but not worse than exponential). 

2. Automata 

We use the symbol uj to denote the set of natural numbers {0, 1, 2, . . .}. For an alphabet 
E, £* is the set of finite words over E and E w is the set of infinite words over E. The 
concatenation of words u E £* and v E £* U £ w will be denoted by uv, and the empty 
word by e. The concatenation is naturally generalised for infinite sequences of finite words 
viv 2 v 3 The concatenation of sets ACS*,BCE*uE u is AB = {uv : u E A,v E B}. 

A tree is any subset of uj* closed under the prefix relation. An element of a tree is usually 
called a node. A leaf is any node of a tree which is not a (strict) prefix of some other node. 
A Yj-labelled tree (or a tree over £) is a function t : domt — > £ such that domt is a tree. 
For v £ domt we define t.v as a subtree of t rooted in v, i. e., dom (t.v) = {u : vu E domt}, 
t.v(u) = t(vu). 

A full n-ary Ti-labeled tree is a function t : {0, 1, . . . , n — 1}* — ► £. The symbol Ts will 
denote the set of full binary trees over E. From now on, if not stated otherwise, a "tree" 
will mean a full binary tree over some alphabet. 

Out of a variety of acceptance conditions for automata on infinite structures, we choose 
the parity condition. A nondeterministic parity automaton on words can be presented as 
a tuple A = (E, Q, 5, qo, rank), where E is a finite input alphabet, Q is a finite set of states, 
<5CQx£xQis the transition relation, and qo E Q is the initial state. The meaning 
of the function rank : Q — > uj will be explained later. Instead of (q,a,q\) E 5 one usually 
writes q — — -> q\ . A run of an automaton A on a word w E E 1 ^ is a word p w E Q w such that 
Pw(0) = qo and if p w {n) = q, p w (n + 1) = qi, and w(n) = a, then g — > eft. A run is 
accepting if the highest rank repeating infinitely often in is even; otherwise p w is rejecting. 
A word is accepted by ^4 if there exists an accepting run on it. The language recognised by 
A, denoted L{A) is the set of words accepted by A. An automaton is deterministic if its 
relation of transition is a total function Q x E — ► Q. Note that a deterministic automaton 
has a unique run (accepting or not) on every word. We call a language deterministic if it is 
recognised by a deterministic automaton. 

A nondeterministic automaton on trees is a tuple A = (E, Q, 5, qo, rank), the only dif- 
ference being that KQxSxQxQ, Like before, q — * Qi,Q2 means (q, a, q\, q%) E 5. 

We write q q\ if there exists a state q2 such that q —* qi,q2- Similarly for q 52- 
A run of A on a tree t E Is is a tree p< E Tq such that p*(e) = go and if pt(v) = q, 
Pt(v0) = qi, pt(vl) = q2 and t(v) = a, then q — > qi,q2- A path it of the run p t is accepting 
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Figure 1: The Mostowskr-Rabin index hierarchy. 



if the highest rank repeating infinitely often in n is even; otherwise tt is rejecting. A run is 
called accepting if all its paths are accepting. If at least one of them is rejecting, so is the 
whole run. An automaton is called deterministic if its transition relation is a total function 
QxS^QxQ. 

By A q we denote the automaton A with the initial state set to q. A state q is all- 
accepting if A q accepts all trees, and all-rejecting if A q rejects all trees. A state (a transition) 
is called productive if it is used in some accepting run. Observe that being productive is 
more than just not being all-rejecting. A state q is productive if and only if it is not all- 

,■ 1,1 ■ ,1 O"0,rfo <?n,d n . CFi,di / y , , , / 

rejecting and there is a path qo — > q\ — > . . . — > q such that qi — > q\, di ^ di, and q i 
is not all-rejecting for i = 0, 1, . . . , n. 

Without loss of generality we may assume that all states in A are productive save 
for one all-rejecting state _L and that all transitions are either productive or are of the 
form q — > _L, _L. The reader should keep in mind that this assumption has influence on 
the complexity of our algorithms. Transforming a given automaton into such a form of 
course needs calculating the productive states, which is equivalent to deciding a language's 
emptiness. The latter problem is known to be in NP n co-NP and has no polynomial time 
solutions yet. Therefore, we can only claim that our algorithms are polynomial for the 
automata that underwent the above preprocessing. We will try to mention it whenever it 
is particularly important. 

The Mostowski-Rabin index of an automaton A is a pair 



An automaton with index (t, k) is often called a (t, K)-automaton. Scaling down the rank 
function if necessary, one may assume that minrankQ is either or 1. Thus, the indices 
are elements of {0,1} x u \ {(1,0)}. For an index (l,k) we shall denote by (l,k) the dual 
index, i. e., (0, k) = (1, K + 1), (1, k) = (0, k — 1). Let us define an ordering of indices with 
the following formula 



In other words, one index is smaller than another if and only if it uses less ranks. This 
means that dual indices are not comparable. The Mostowski-Rabin index hierarchy for 
a certain class of automata consists of ascending sets (levels) of languages recognised by 
(l, K)-automata (see Fig. [TJ). 

The fundamental question about the hierarchy is the strictness, i. e., the existence of 
languages recognised by a (l, K)-automaton, but not by a (t, k) -automaton. The strictness of 
the hierarchy for deterministic automata follows easily from the strictness of the hierarchy 
for deterministic word automata [15J: if a word language L needs at least the index (t, k), 
so does the language of trees that have a word from L on the leftmost branch. The index 
hierarchy for nondeterministic automata is also strict [6j. In fact, the languages showing 
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Figure 2: A (0, 2)-flower. 



the strictness may be chosen deterministic: one example is the family of the languages of 
trees over the alphabet {l, l + 1, . . . , k} satisfying the parity condition on each path. 

The second important question one may ask about the index hierarchy is how to deter- 
mine the exact position of a given language. This is known as the index problem. 

Given a deterministic language, one may ask about its deterministic index, i. e., the 
exact position in the index hierarchy of deterministic automata (deterministic index hier- 
archy). This question can be answered effectively. Here we follow the method introduced 
by Niwihski and Walukiewicz [7]. 

A path in an automaton is a sequence of states and transitions: 

ai.di <J2,d,2 <T n _i,ef„_i 
PO ► Pi ► • • ■ ► Pn ■ 

A loop is a path starting and ending in the same state, po — >p\ — ► . . . — >pq. A loop is 
called accepting if maxjrank(pj) is even. Otherwise it is rejecting. A j-loop is a loop with 
the highest rank on it equal to j. A sequence of loops A M A t _|_i, . . . , A K in an automaton is 
called an alternating chain if the highest rank appearing on Aj has the same parity as i 
and it is higher then the highest rank on Aj_i for i = l,l + 1, . . . , k. A (t, k) -flower is an 
alternating chain A M A t +i, . . . , \ K such that all loops have a common state q (see Fig. [2]). Q 
Niwihski and Walukiewicz use flowers in their solution of the index problem for deter- 
ministic word automata. 

Theorem 2.1 (Niwihski, Walukiewicz [7]). A deterministic automaton on words is equiv- 
alent to a deterministic (t, k)- automaton iff it does not contain a (t, k) -flower. □ 

For a tree language L over S, let Paths(L) C (Ex {0, 1}) W denote the language of generalised 
paths of L, 

Paths(L) = {((<7i,di), (cr 2 ,cfe), • • •) : ^gl Vj t(did 2 ■ ■ .<k-i) = Vi} . 

A deterministic tree automaton A, can be treated as a deterministic word automaton recog- 
nising Paths(L(y4)). Simply for A = (Q, Y*,qo,5, rank), take (Q, Sx{0, 1}, qo, 5' , rank), where 



This is a slight modification of the original definition from [7]. 
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Figure 3: A weak (1, 3) -flower. 



(p,(a,d),q) G 5' (p,a,d,q) £ 5. Conversely, given a deterministic word automaton 

recognising Paths(L(A)), one may interpret it as a tree automaton, obtaining thus a deter- 
ministic automaton recognising L(A). Hence, applying Theorem 12,11 one gets the following 
result. 

Proposition 2.2. For a deterministic tree automaton A the language L(A) is recognised 
by a deterministic (l, k,)- automaton iff A does not contain a (l, k)- flower. □ 

In [5] it is shown how to compute the weak deterministic index of a given deterministic 
language. An automaton is called weak if the ranks may only decrease during the run, 
i. e., if p — > q, then rank(p) > rank^). The weak deterministic index problem is to 
compute a weak deterministic automaton with minimal index recognising a given language. 
The procedure in [5] is again based on the method of difficult patterns used in Theorem 
12.11 and Proposition 12.21 We need the simplest pattern exceeding the capability of weak 
deterministic (l, K)-automata. Just like in the case of the deterministic index, it seems 
natural to look for a generic pattern capturing all the power of (l,k). Intuitively, we need 
to enforce the alternation of ranks provided by (l, k). Let a weak (c, K)-flower be a sequence 
of loops X L , A t+ i . . . , X K such that \j+± is reachable from Xj, and Xj is accepting iff j is even 
(see Fig. ED. 

Proposition 2.3 (|5j). A deterministic automaton A is equivalent to a weak deterministic 
(l, k)- automat on iff it does not contain a weak (l, k)- flower. □ 

For a deterministic language one may also want to calculate its nondeterministic index, 
i. e., the position in the hierarchy of nondeterministic automata. This may be lower than 
the deterministic index, due to greater expressive power of nondeterministic automata. 
Consider for example the language Lm consisting of trees whose leftmost paths are in 
a regular word language M. It can be recognised by a nondeterministic (1, 2)-automaton, 
but its deterministic index is equal to the deterministic index of M, which can be arbitrarily 
high. 

The problem transpired to be rather difficult and has only just been solved in [9]. 
Decidability of the general index problem for nondeterministic automata is one of the most 
important open questions in the field. 
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3. Topology 

We start with a short recollection of elementary notions of descriptive set theory. For 
further information see [3]. 

Let 2 aJ be the set of infinite binary sequences with a metric given by the formula 



Both 2 W and Ts, with the topologies induced by the above metrics, are Polish spaces (com- 
plete metric spaces with countable dense subsets). In fact, both of them are homeomorphic 
to the Cantor discontinuum. 

The class of Borel sets of a topological space X is the closure of the class of open sets 
of X by complementation and countable sums. Within this class one builds so called Borel 
hierarchy. The initial (finite) levels of the Borel hierarchy are defined as follows: 

• T*i(X) - open subsets of X, 

• Il9(X) - complements of sets from S2(X), 

• El +1 (X) - countable unions of sets from Ll2(X). 



For example, n§(X) are closed sets, S^X) are F a sets, and Il^pf) are G$ sets. By con- 
vention, 11° (X) = {X} and Egpf) = {0}. 



Even more general classes of sets from the projective hierarchy. We will not go beyond 
its lowest level: 

• E^(X) - analytical subsets of X, i. e., projections of Borel subsets of X 2 with product 
topology, 

• IlJ(X) - complements of sets from S}(X). 

Whenever the space X is determined by the context, we omit it in the notation above 
and write simply E§, LT^, and so on. 

Let (p : X — > Y be a continuous map of topological spaces. One says that (p is a reduction 
of A C X to B C Y, if V X £X x £ A «-> <^(x) € -B. Note that if is in a certain class of 
the above hierarchies, so is A. For any class C a set B is C-hard, if for any set A £ C there 
exists a reduction of A to -B. The topological hierarchy is strict for Polish spaces, so if a set 
is C-hard, it cannot be in any lower class. If a C-hard set B is also an element of C, then it 
is C- complete. 

In 2002 Niwihski and Walukiewicz discovered a surprising dichotomy in the topological 
complexity of deterministic tree languages: a deterministic tree language has either a very 
low Borel rank or it is not Borel at all (see Fig. [4]). We say that an automaton A admits 

a split if there are two loops p po — > . . . — ► p and p p\ — > . . . — > p such that 
the highest ranks occurring on them are of different parity and the higher one is odd. 

Theorem 3.1 (Niwihski, Walukiewicz [8]). For a deterministic automaton A, L(A) is on 
the level II3 of the Borel hierarchy iff A does not admit split; otherwise L(A) is H\-complete 
(hence non-Borel). □ 





and Ts be the set of infinite binary trees over S with a metric 

f 2 _min {l a: l : xe {°' 1 J'*' s ( x )^ t ( x )} iff s -t t 
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Figure 4: The Borel hierarchy for deterministic tree languages. 



An important tool used in the proof of the Gap Theorem is the technique of difficult 
patterns. In the topological setting the general recipe goes like this: for a given class identify 
a pattern that can be "unravelled" to a language complete for this class; if an automaton 
does not contain the pattern, then L(A) should be in the dual class. In the proof of the Gap 
Theorem, the split pattern is "unravelled" into the language of trees having only finitely 
many l's on each path. This language is Ll}-complete (via a reduction of the set of well- 
founded trees). 

In [5] a similar characterisation was obtained for the remaining classes from the above 
hierarchy. Before we formulate these result, let us introduce one of the most important 

technical notions of this study. A state p is replicated by a loop q\ ^-^ q2 — ► • • . — ► qi if 

there exist a path qi -^4 q' 2 — > . . . — > p such that do ^ d\ . We will say that a flower is 
replicated by a loop A if it contains a state replicated by A. The phenomenon of replication 
is the main difference between trees and words. We will use it constantly to construct hard 
languages that have no counterparts among word languages. Some of them occur in the 
proposition below. 

Theorem 3.2 (Murlak [5]). Let A be a deterministic automaton. 

(1) L(A) G £5 iff A does not contain a weak (0, 1) -flower. 

(2) L(A) G n{ iff A does not contain a weak (1,2) -flower. 

(3) L(A) £ S§ iff A does not contain a (1,2) -flower nor a weak (1,2) -flower replicated by 
an accepting loop. 

(4) L(A) G IIP, iff A does not contain a (0, l)-flower. 

(5) L(A) G S3 iff A does not contain a (0, 1) -flower replicated by an accepting loop. □ 



The notion of continuous reduction defined in Sect. [3] yields a preordering on sets. Let 
X and Y be topological spaces, and let A C X, B C Y . We write A <\y B (to be read "A is 
Wadge reducible to B"), if there exists a continuous reduction of A to B, i. e., a continuous 
function (p: X — > Y such that A = ip~ 1 (B). We say that A is Wadge equivalent to B, in 
symbols A =w B, if A <w B and A <w B. Similarly we write A <w B if A <\y B and 
B A. The Wadge ordering is the ordering induced by <w on the =vK-classes of subsets 
of Polish spaces. The Wadge ordering restricted to Borel sets is called the Wadge hierarchy. 

In this study we only work with the spaces and . Since we only consider finite E, 
these spaces are homeomorphic with the Cantor discontinuum {0, 1}^ as long as |S| > 2. In 
particular, all the languages we consider are Wadge equivalent to subsets of {0, 1}^ . Note 
however that the homeomorphism need not preserve recognisability. In fact, no homeomor- 
phism from Ts to {0, 1} W does: the Borel hierarchy for regular tree languages is infinite, but 
for words it collapses on A3. In other words, there are regular tree languages (even weak, or 
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deterministic), which are not Wadge equivalent to regular word languages. Conversely, each 
regular word language L is Wadge equivalent to a deterministic tree language L' consisting 
of trees which have a word from L on the leftmost branch. As a consequence, the height 
of the Wadge ordering of regular word languages gives us a lower bound for the case of 
deterministic tree languages, and this is essentially everything we can conclude from the 
word case. 

The starting point of this study is the Wadge reducibility problem. 

Problem: Wadge reducibility 

Input: Deterministic tree automata A and B 
Question: L(A) < w L{B)1 

An analogous problem for word automata can be solved fairly easy by constructing a tree 
automaton recognising Duplicator's winning strategies (to be defined in the next section). 
This method however does not carry over to trees. One might still try to solve the Wadge 
reducibility problem directly by comparing carefully the structure of two given automata, 
but we have chosen a different approach. We will provide a family of canonical deterministic 
tree automata A = {Ai : i £ 1} such that 

(1) given i,j € /, it is decidable if L(A{) <w L(Aj), 

(2) for each deterministic tree automaton there exists exactly one i E I such that L(A) =w 
L(Ai), and this i can be computed effectively for a given A. 

The decidability of the Wadge reducibility problem follows easily from the existence of 
such a family: given two deterministic automata A and B, we compute i and j such that 
L(A) = w L(Ai) and L(B) = w L(Aj), and check if L{Ai) < w L(Aj). 
More precisely, we prove the following theorem. 

Theorem 4.1. There exists a family of deterministic tree automata 

C = {C a : a e I}U{D a ,E a : a G J} 

with I = {a : < a < w w ' 3 + 2}, J = {n : < n < uj} U {uJ u) ' 2 a 2 + u w a x + n : a 2 < ui", < 
ct\ < uj w , n < uj} such that 

(1) for < a < (3 < u^' 3 + 2, whenever the respective automata are defined, we have 

L{C a ) L{Cp) 

\ / \ 

L(E a ) L{E P ) 

/ \ / 

L(D a ) L{D p ) 

where — ► means <w, and L(C a ) and L(D a ) are incomparable, 

(2) for each deterministic tree automaton A there exists exactly one automaton A' € C such 
that L{A') =w L{A) and it is computable, i.e., there exists an algorithm computing for 
a given A a pair (£, a) £ {C} x I U {D, E} x J such that L{A) =w L(E a ). 

The family C' satisfies the conditions postulated for the family of canonical automata A: 
for ordinals presented as arithmetical expressions over u> in Cantor normal form the ordinal 
order is decidable, so we can take {C} x / U {D, E} x J as the indexing set of A. 

Observe that the pair (S, a) computed for a given A can be seen as a name of the 
=v^-class of L(A). Hence, the set {C} x I U {D, E} x J together with the order defined in 
the statement of theorem provides a complete effective description of the Wadge hierarchy 
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restricted to deterministic tree languages. One thing that follows is that the height of the 
hierarchy is w 1 ^' 3 + 3. 

The remaining part of the paper is in fact a single long proof. We start by reformulating 
the classical criterion of reducibility via Wadge games in terms of automata (Sect. [5]). 
This will be the main tool of the whole argument. Then we define four ways of composing 
automata: sequential composition ©, replication — ►, parallel composition A, and alternative 
V (Sect. [6]). Using the first three operations we construct the canonical automata, all but 
top three ones (Sect. [7]). Next, to rehearse our proof method, we reformulate and prove 
Wagner's results in terms of canonical automata (Sect. [8|). Finally, after some preparatory 
remarks (Sect. [9|), we prove the first part of Theorem 14.11 modulo three missing canonical 
automata. 

Next, we need to show that our family our family contains all deterministic tree au- 
tomata up to Wadge equivalence of the recognised languages. Once again we turn to the 
methodology of patterns used in Sect. [2] and Sect. [3J We introduce a fundamental notion 
of admittance, which formalises what it means to contain an automaton as a pattern (Sect. 

[TT]) . Then we generalise — > to (t, /^-replication — in order to define the remaining three 
canonical automata, and rephrase the results on the Borel hierarchy and the Wagner hier- 
archy in terms of admittance of canonical automata (Sect. [121) . Basing on these results, we 
show that the family of canonical automata is closed by the composition operations (Sect. 
[T3j) . and prove the Completeness Theorem asserting that (up to Wadge equivalence) each 
deterministic automaton may be obtained as an iterated composition of G\ and D\ (Sect. 
[T4"|) . As a consequence, each deterministic automaton is equivalent to a canonical one. From 
the proof of the Completeness Theorem we extract an algorithm calculating the equivalent 
canonical automata, which concludes the proof of Theorem 14.11 

5. Games and Automata 

A classical criterion for reducibility is based on the notion of Wadge games. Let us 
introduce a tree version of Wadge games (see |10| for word version). By the nth level of a 
tree we understand the set of nodes {0, l} n_1 . The 1st level consists of the root, the 2nd level 
consists of all the children of the root, etc. For any pair of tree languages L C Tj^ , M C T% 2 
the game Gw(L, M) is played by Spoiler and Duplicator. Each player builds a tree, ts € Tej 
and tr> € Ts 2 respectively. In every round, first Spoiler adds some levels to ts and then 
Duplicator can either add some levels to tp or skip a round (not forever) . The result of the 
play is a pair of full binary trees. Duplicator wins the play if ts € L <4=> tjj € M. We say 
that Spoiler is in charge of L, and Duplicator is in charge of M. 

Just like for the classical Wadge games, a winning strategy for Duplicator can be easily 
transformed into a continuous reduction, and vice versa. 

Lemma 5.1. Duplicator has a winning strategy in Gyy(L, M) iff L <w M. 

Proof. A strategy for Duplicator defines a reduction in an obvious way. Conversely, suppose 
there exist a reduction t i— > <p(t). It follows that there exist a sequence n/~ (without loss 
of generality, strictly increasing) such that the level k of ip(t) depends only on the levels 
1,2, ... ,TOfc of t. Then the strategy for Duplicator is the following: if the number of the 
round is n^, play the kth level of to according to ip; otherwise skip. □ 
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We would like to point out that Wadge games are much less interactive than classical 
games. The move made by one player has no influence on the possible moves of the other. Of 
course, if one wants to win, one has to react to the opponent's actions, but the responses need 
not be immediate. As long as the player keeps putting some new letters, he may postpone 
the real reaction until he knows more about the opponent's plans. Because of that, we will 
often speak about strategies for some language without considering the opponent and even 
without saying if the player in charge of the language is Spoiler or Duplicator. 

Since we only want to work with deterministically recognisable languages, let us redefine 
the games in terms of automata. Let A, B be deterministic tree automata. The automata 
game G(A, B) starts with one token put in the initial state of each automaton. In every 
round players perform a finite number of the actions described below. 

Fire a transition: for a token placed in a state q choose a transition q — ► Qi,q2, take the 

old token away from q and put new tokens in q\ and 52 • 
Remove: remove a token placed in a state different from _L. 

Spoiler plays on A and must perform one of these actions at least for all the tokens produced 
in the previous round. Duplicator plays on B and is allowed to postpone performing an 
action for a token, but not forever. Let us first consider plays in which the players never 
remove tokens. The paths visited by the tokens of each player define a run of the respective 
automaton. We say that Duplicator wins a play if both runs are accepting or both arc 
rejecting. Now, removing a token from a state p is interpreted as plugging in an accepting 
subrun in the corresponding node of the constructed run. So, Duplicator wins if the runs 
obtained by plugging in an accepting subrun for every removed token are both accepting 
or both rejecting. 

Observe that removing tokens in fact does not give any extra power to the players: 
instead of actually removing a token, a player may easily pick an accepting subrun, and in 
future keep realising it level by level in the constructed run. The only reason for adding 
this feature in the game is that it simplifies the strategies. In a typical strategy, while some 
tokens have a significant role to play, most are just moved along a trivially accepting path. 
It is convenient to remove them right off and keep concentrated on the real actors of the 
play. 

We will write A < B if Duplicator has a winning strategy in G(A, B). Like for languages, 
define A = B iff A < B and A > B. Finally, let A < B iff A < B and A ~£ B. 

Lemma 5.2. For all deterministic tree automata A and B, 

A < B <==> L(A) < w L(B). 

Proof. First consider a modified Wadge game G' W (L, M), where players are allowed to build 
their trees in an arbitrary way provided that the nodes played always form one connected 
tree, and in every round Spoiler must provide both children for all the nodes that were 
leaves in the previous round. It is very easy to see that Duplicator has a winning strategy 
in G' W (L, M) iff he has a winning strategy in Gw(L, M). 

Suppose that Duplicator has a winning strategy in G(A, B). We will show that Duplica- 
tor has a winning strategy in G' W (L(A), L(B)), and hence L(A) <w L(B). What Duplicator 
should do is to simulate a play of G(A, B) in which an imaginary Spoiler keeps constructing 
the run of A on the tree ts constructed by the real Spoiler in G' W (L(A), L(B)), and Duplica- 
tor replies according to his winning strategy that exists by hypothesis. In G' W (L(A), L(B)) 
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Duplicator should simply construct a tree such that £>'s run on it is exactly Duplicator's 
tree from G(A, B). 

Let us move to the converse implication. Now, Duplicator should simulate a play in 
the game G' W (L(A), L(B)) in which Spoiler keeps constructing a tree such that ^4's run 
on it is exactly the tree constructed by the real Spoiler in G(A,B), and Duplicator replies 
according to his winning strategy. In G(A, B) Duplicator should keep constructing the run 
of B on t|) constructed in the simulated play. □ 

As a corollary we have that all automata recognising a given language have the same 
"game power". 

Corollary 5.3. For deterministic tree automata A and B, if L(A) = L(B), then A = B.\3 

Classically, in automata theory we are interested in the language recognised by an 
automaton. One language may be recognised by many automata and we usually pick the 
automaton that fits best our purposes. Here, the approach is entirely different. We are not 
interested in the language itself, but in its Wadge equivalence class. This, as it turns out, is 
reflected in the general structure of the automaton. Hence, our main point of interest will 
be that structure. 

We will frequently modify an automaton in a way that does change the recognised 
language, but not its =vi^-class. One typical thing we need to do with an automaton, is to 
treat it as an automaton over an extended alphabet in such a way, that the new recognised 
language is Wadge equivalent to the original one. This has to be done with some care, since 
the automaton is required to have transitions by each letter from every state. Suppose 
we want to extend the input alphabet by a fresh letter r. Let us construct an automaton 
A T . First, if A has the all-rejecting state _L, add a transition _L — ^ _L, _L. Then add an 
all-accepting state T with transitions T —>■ T, T for each a € £ U {r} (if A already has 
the state T, just add a transition T — ^ T, T). Then for each p £ {_L, T}, add a transition 

Lemma 5.4. For every deterministic tree automaton A over £ and a letter r S, A = A T . 

Proof. It is obvious that A < A T : since A T contains all transitions of A, a trivial winning 
strategy for Duplicator in G(A,A T ) is to copy Spoiler's actions. Let us see that new tran- 
sitions do not give any real power. Consider G(A T ,A). While Spoiler uses old transitions, 
Duplicator may again copy his actions. The only difficulty lies in responding to a move that 
uses a new transition. Suppose Spoiler does use a new transition. If Spoiler fires a transition 
p — ^ T, T for a token x in a state p / 1, Duplicator simply removes the corresponding 
token in p, and ignores the further behaviour of x and all his descendants. The only other 
possibility is that Spoiler fires _L — ^ _L, !_. Then for the corresponding token Duplicator 
should fire _L _L, _L for some a G S. The described strategy is clearly winning for Du- 
plicator. □ 

An automaton for us is not as much a recognising device, as a device to carry out 
strategies. Therefore even two automata with substantially different structure may be 
equivalent, as long as they enable us to use the same set of strategies. A typical thing we 
will be doing, is to replace a part of an automaton with a different part that gives the same 
strategical possibilities. Recall that by A q we denote the automaton A with the initial state 
changed to q. For q 6 Q let A q:= B denote the automaton obtained from a copy of A and 
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Figure 5: The alternative A VB, the parallel composition A A 5, and the replication ^4 ^ 
(transitions to _L and T are omitted). 



a copy of B by replacing each ^4's transition of the form p — q with p — q£. Note that 
A q:= A q is equivalent to A. 

Lemma 5.5 (Substitution Lemma). Let A, B, C be deterministic automata with pairwise 
disjoint sets of states, and let p be a state of C . If A< B, then C p —a < C p —b- 

Proof. Consider the game G(C p —a, C p —b) and the following strategy for Duplicator. In C 
Duplicator copies Spoiler's actions. If some Spoiler's token x enters the automaton A, Du- 
plicator should put its counterpart y in the initial state of B, and then y and its descendants 
should use Duplicator's winning strategy from G(A, B) against x and its descendants. 

Let us see that this strategy is winning. Suppose first that Spoiler's run is rejecting. 
Then there is a rejecting path, say ir. If on ir the computation stays in C, in Duplicator's 
run ir is also rejecting. Suppose ir enters A. Let v be the first node of ir in which the 
computation is in A. The subrun of Spoiler's run rooted in v is a rejecting run of A. Since 
Duplicator was applying a winning strategy form G(A, B), the subrun of Duplicator's run 
rooted in v is also rejecting. In either case, Duplicator's run is rejecting. 

Now assume that Spoiler's run is accepting, and let us see that so is Duplicator's. All 
paths staying in C are accepting, because they are identical to the paths in Spoiler's run. 
For every v in which the computation enters B, the subrun rooted in v is accepting thanks 
to the winning strategy form G(A, B) used to construct it. □ 



6. Operations 

It this section we introduce four operations that will be used to construct canonical 
automata representing Wadge degrees of deterministic tree languages. 

The first operation yields an automaton that lets a player choose between A and B. 
For two deterministic tree automata A and B over S, the alternative Ay B (see Fig. [5]) 
is an automaton with the input alphabet S U {a, 6} consisting of disjoint copies of A and 
B over the extended alphabet S U {a, 6}, A a ^ and B a ^, and a fresh initial state qo with 
transitions 

a A a u -r- b B a u , C —i— —i— r J. r n 

Qo — ► q , T , q — > q ' , T , and q — ► T, T for a f {a,b\ 

(only if L(A) = L(B) = put go -L, L). By Lemma l5.51 = is a congruence with respect 
to V. Furthermore, V is associative and commutative up to =. Multiple alternatives are 
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Figure 6: The sequential composition A © B. 



performed from left to right: 

A x y A 2 y A 3 \/ A 4 = ((Ax V A 2 ) V A 3 ) V A 4 . 

The parallel composition A A B is defined analogously, only now we extend the alphabet 
only by a and add transitions 

Qo Qo, Qo , and go T, T for a ^ a 

(only if L(A) = or £(-B) = 0, put go —> -L, -L). Note that, while in ^4 Vi? the computation 
must choose between A and -B, here it continues in both. Again, = is a congruence with 
respect to A. The language L(AAB) is Wadge equivalent to L(A)xL(B) and A is associative 
and commutative up to =. Multiple parallel compositions are performed from left to right, 
and for n > the symbol (A) n denotes A A . . . A A . 

n 

To obtain the replication A — > B, extend the alphabet again by {a, b}, set rank(go) = 1, 
and add and transitions 

qo-^>Qo,T, qo~^qo,qo, and q _L, 1 for a £ {a, b} . 

Like for two previous operations, = is a congruence with respect to — >. 

The last operation we define produces out of A and B an automaton that behaves 
as A, but in at most one point (on the leftmost path) may switch to B. A state p is 
leftmost if no path connecting the initial state with p uses a right transition. In other 
words, leftmost states are those which can only occur in the leftmost path of a run. Note 
that an automaton may have no leftmost states. Furthermore, a leftmost state cannot be 
reachable from a non-leftmost state. In particular, if an automaton has any leftmost states 
at all, the initial state has to be leftmost. For deterministic tree automata A and B over 
E, the sequential composition A® B (see Fig. [6J is an automaton with the input alphabet 
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ttl, 

Figure 7: The canonical (1, 4)-flower ^(1,4) ■ 

£ U {6}, where b is a fresh letter. It is constructed by taking copies of A and B over the 

extended alphabet SU{6) and replacing the transition p — h> r with p — q^ h for each 
leftmost state p and r £ {_L,T}. Like for A and V, we perform the multiple sequential 
compositions from left to right. For n > we often use an abbreviation nA = A ffi . . . © A. 



Observe that if A has a leftmost state, then a state in A © .B is leftmost iff it is a leftmost 
state of A or a leftmost state of B. It follows that the =-class of a multiple sequential 
composition does not depend on the way we put parentheses. An analog of © for word 
automata defines an operation on =-classes, but for tree automata this is no longer true. 
We will also see later that © is not commutative even up to =. 

The priority of the operations is ffi, A, V, — >. For instance A\ — ► A2 © A3 A A4 V A§ = 
A\ — > (((^2 © A3) A A4) V A$). Nevertheless, we usually use parentheses to make the 
expressions easier to read. 

Finally, let us define the basic building blocks, to which we will apply the operations 
defined above. The canonical (t, k) -flower Fu K \ (see Fig. [7|) is an automaton with the 
input alphabet {a t , a L+ \ . . . , a K }, the states q L , q L +i, . . . , q K where the initial state is q L and 
rank(gj) = i, and transitions 

Ql — ^5t,T, q t — >qj,T, qj — > q ,T , and qj — >T,T 

for j = 1 + 1, 1 + 2, . . . , k and k 7^ j. A flower Fu K \ is nontrivial if t < k. 

In the definitions above we often use an all-accepting state T. This is in fact a way of 
saying that a transition is of no importance when it comes to possible strategies: a token 
moved to T has no use later in the play. Therefore, we may assume that players remove 
their tokens instead of putting them to T. In particular, when a transition is of the form 
p — > q, T, it is convenient to treat it as a "left only" transition in which no new token is 
created, only the old token is moved from p to q. Consequently, when analysing games on 
automata, we will ignore the transitions to T. 
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7. Canonical Automata 

For convenience, in this section we put together the definitions of all canonical automata 
(save for three which will be defined much later) together with some very simple properties. 
More explanations and intuitions come along with the proofs in the next three sections. 

For each a < uj^' 3 we define the canonical automaton C a . The automata D a and E a 
will only be defined for < a < oj and a = uj w ' 2 oi2 + w w ai + n with < a± < oj m , ct2 < w", 
n < oj. All the defined automata have at least one leftmost state, so the operation © is 
always non-trivial. 

Let C\ = ^(o,o)) D\ = ^(1,1), and E\ = i*(o,o) V ^(1,1) ■ For 1 < a < w define 

C a = d ffi (a - , 
D a = Di © (a - l)Ei , 
E a = aEi . 

For w < a < u w we only define C a . Let C w = C\ — > C3 and C^k+i = C\ — ► (Ci © C^fc) 
for 1 < fc < cj. For every a from the considered range we have a unique presentation 
a = uArifc + uj lk ~ 1 nk-i + . . . + u/°no, with uj > l k > 0, l k > h-i > ■ ■ ■ > lo and < n, < cj. 
For lo = define 

C« = C no © niC^ij © ... © n k Cj k for odd n , 
C a = D no © niC^ © ... © n k Cj k for even n , 

and for Iq > set 

Ca = n C^ © niC^ © ... © nfcC^ifc . 

Now consider uj" < a < uj^' 2 . For k < uj let C^w+fc = i*(o,fc+i)> D^+k = ^(1^+2) 
and E^+k = i*(o,fc+i) V ^(1^+2) • For every a from the considered range we have a unique 
presentation a = + ao with ao, «i < ^ an d QT > 0. Let a\ = u> lk n k + u> lk - 1 n k _i + 

. . . + w'°no, with uj > l k > lk-i > ■ ■ ■ > lo and < nj < w. For ao = and Zo = 1 let 

C« = C^uj+Iq © niE^+h © ... © n k E^ w +i k , 
-D a = D^+io © niE^ +h © ... © n k E^+i k , 
E a = E u u>+i © niE^+h © ... © n k E^+i k , 
for ao = and Zo > 1 let 

C a = C^+iq © (n - l)E^+i © mE^+h © ... © n k E^+i k , 
-D Q = D uU +i © (n - l)E^+ k) © niE^+ h © ... © n k E w «,+i k , 
-Z?a = noE^+io © nxE^+h © ... © n k E u +i k , 



for a; > ao > let 



Ca = Ca ffi E^oii j 
-Da = -D ao ffi E^ ai , 
-^a = -Ea © E w uj ai , 



and for ao > w let 



C a — Cq ffi E,jjw ai . 

Finally consider oj w ' 2 < a < w w ' 3 . Let C^-i = C\ — > i*(o,2)> an d for A; < u; let 
C wW .2+fe+i = Ci — > (Ci ffi C^-2+fc). We have a unique presentation a = u> u) ' 2 a2 + + ao 
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with ao, cti,ct2 < w w and 02 > 0. Let Q2 = uj k nt + uj k - 1 rik-i + . . . + u °no, with uj > > 
h-i > ■ ■ ■ > Iq and < rii < uj. For ao = ol\ = let 

C a = noC^-2+io © © ... © nkC^u,-2+i k , 

for «o = and «i > let 







1 C^-2 a2 












1 Co;"- 2 a2 



for > cto > and cci = let 

C« = Ca © C^-2 a2 for odd a , 
C a = D ao © C w a,.2 a2 for even a , 
and in the remaining case (ao > w or «i > 0) let 

Let C denote the family of the canonical automata, i. e., 

C = {C a : a < uj^' 3 } U {D n ,E n : n < uj} U 

U {D^-2 a2+LO u ai+n , E^-2 a2+LO u ai+n : < ai < ^ , oi 2 < u) w , n < uj} . 

In the next three sections we will investigate the order induced on C by the Wadge ordering 
of the recognised languages. 

Now, let us discuss briefly the anatomy and taxonomy of the canonical automata. 
Simple automata are those canonical automata that cannot be decomposed with respect 
to ©, i. e., the automata on the levels uj k , uj UJ+k , and uj UJ ' 2+k for k < uj. Complex automata 
are those obtained from simple ones by means of 0. If for some automata A±, A2, ■ ■ ■ , A n 
we have A = A\ © A2 © . . . © A n , we call Ai components of A. If Ai are simple, they are 
called simple components of A, Ai is the head component, and A n is the tail component. 
Non-branching canonical automata are those constructed from flowers without the use of 
— >, i. e., C^a+n, D^a+n, E^a+n for a < uj u and n < uj. The remaining automata are 
called branching. The term head loop refers to any minimal-length loop around the initial 
state. If the head component of a canonical automaton is branching, then the automaton 
has only one head loop. Similarly, if the head component is C\ or D±. 

According to the definition of the automata game, in a branching transition a token 
is split in two. However in branching canonical automata, the role to be played by two 
new tokens is very different. Therefore, we prefer to see the process of splitting a token 
as producing a new token that moves along the right branch of the transition, while the 
original one moves left. Thus each token moves along the leftmost path from the node it 
was born in, bubbling out new tokens to the right. Let us prove the following simple yet 
useful property of those paths. 

Proposition 7.1. If a run constructed by a player in charge of a canonical automaton is 
rejecting, one of the tokens has visited a rejecting path. 

Proof. Observe that in a canonical automaton the only loop using right transitions is the 
loop around T. In other words, each path of the constructed computation that does not 
reach T goes right only a bounded number of times (depending on the automaton). Now, 
consider a rejecting run constructed during a play. It must contain a rejecting path ir. The 
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token created during the last right transition on tt visits a suffix of it, which of course is 
a rejecting path. □ 

Recall that we have defined the operation © in such a way, that the second automaton 
can only be reached via a leftmost path. This means that the only token that can actually 
move from one simple automaton to another is the initial token. Since passing between the 
simple automata forming a canonical automaton is usually the key strategic decision, we 
call the initial token critical, and the path it moves along, the critical path. 

Since we can remove the tokens from T with no impact on the outcome of the game, 
we can assume that in transitions of the form p — — > q, T or p — ► T, q no new tokens are 
produced, only the old token moves from p to q. The following fact relies on this convention. 

Proposition 7.2. If a player in charge of a canonical automaton produces infinitely may 
tokens, the resulting run is rejecting. 

Proof. We will proceed by structural induction. The claim holds trivially for non-branching 
automata. Suppose now that A = C\ — * A'. If the constructed run is to be accepting, the 
player can only loop a finite number of times in the head loop of A, thus producing only 
a finite number of new tokens. By the induction hypothesis for A 1 , those tokens can only 
have finitely many descendants. Hence, in the whole play there can be only finitely many 
tokens. 

Now, take A = A' © A" . Suppose there were infinitely many tokens in some play on A. 
Observe that all the tokens in A" are descendants of A'"s critical token. Hence, if there were 
infinitely many tokens in A" , by the induction hypothesis for A" the whole run is rejecting. 
Suppose there were infinitely many tokens in A'. Consider a play in which the critical token 
instead of moving to A" stays in the last accepting loop of A' (it exists by the definition of 
canonical automata). In such a play a run of A' is build. Since there are infinitely many 
tokens used, the run is rejecting by the induction hypothesis for A' . Consequently, the run 
of A constructed in the original play must have been rejecting as well. □ 



8. Without Branching 

In this section we briefly reformulate Wagner's results on regular word languages [15] in 
terms of canonical automata. For the sake of completeness, we reprove them in our present 
framework. 

The scenario is just like for tree languages: define a collection of canonical automata, 
prove that they form a strict hierarchy with respect to the Wadge reducibility, check some 
closure properties, and provide an algorithm calculating the equivalent canonical automaton 
for a given deterministic automaton, thus proving that the hierarchy is complete for regular 
languages. 

Since the non-branching canonical automata have only left transitions, they only check 
a regular word property on the leftmost path. It is easy to see that for each word language 
K, the language of trees whose leftmost branch is in K is Wadge equivalent to K. Based 
on this observation, we will treat the non-branching canonical automata as automata on 
words. 

Let Lr LjK \ denote the language of infinite words over + 1, . . . , k} that satisfy the 
parity condition, i. e., the highest number occurring infinitely often is even. 
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Lemma 8.1. For every index (l,k) and every deterministic tree automaton A of index at 
most (i, k), 

(1) L(A) < L M , 

(2) L(F( t)K )) =w L^ K ), 

(3) L (tA) < L (t)K /) iff (i,K) < (l',k'). 

Proof. A reduction showing (1) is given by w i— » rank(g , o)rank(gi)rank(g2) ■ • -, where qoQiQ2 ■ ■ 
is the run of ^4 on the word ty. 

For (2) the remaining reduction is obtained by assigning to a sequence n\ninj, . . . the 
tree with the word a ni a ni a n2 a n2 a n3 a n3 ... on the leftmost branch, and a a L elsewhere. 

Since can be recognised by a (t, k) automaton, one implication in (3) follows from 
(1). To prove the remaining one, it is enough to show that £( tjK ) % ^UTk)' us ^ x L 
and proceed by induction on k. For i = k the claim holds trivially: C T!m and Tm 
are not reducible to each other. Take t < k and let (l',k') = (t, k). Consider the game 
G K = Gw(Lu k \,L^i )K i\). As long as Duplicator does not play «/, Spoiler can follow the 
strategy from G K _i = Gy/ (-^( t , K -i); Dr If Duplicator never plays k', he loses. When 

Duplicator plays k', Spoiler should play k, and then again follow the strategy from G K _i, 
and so on. Each time, Duplicator has to play k' finally, otherwise he loses. But then he 
must play k' infinitely many times, and he loses to, since k' and n have different parity. O 

For the sake of convenience let us renumber the non-branching automata. For rj < uj w 

let 

Let C = {C a ,D a , E a :l<a< uj"}. 
Proposition 8.2. For < a < j3 < uj 1 ^ we have 

C a Cp 

\ „ / \ 

E a Ep 

/ \ / 

Da Dp 

where — ► means <. Furthermore, C a % D a and D a j£ C a . 

Proof. First, observe that C a < E a : a winning strategy for Duplicator in G(C a ,E a ) is 
to move the initial token to F^ ^, and then simply copy Spoiler's actions. Analogously, 
D a < E a . 

Let us now suppose that f3 = u> k for some k < u>. Then a = lj ^n^i + . . . + uq. By 
definition, E a , has index at most (0, k). Hence, by Lemma |8.1| E a < i^o.fc) = Gp. If we 
increase the ranks in each i 7 "^,/) in E a by 2, we obtain an automaton with index at most 
(1, k + 1) recognising the same language. Hence, we also have E a < F^i^ k+1 ^ = D a . 

Now, consider the general case. We have a unique pair of presentations a = ui k m k + 
. . . + rriQ and (3 = io k n k + . . . + uq with n k > 0. Let i be the largest number satisfying 
mi y£ m. Since a < (3, rrii < ni . Thus we have E a = E ao ffi E^, Cp = Ep © C 7 , where 
7 = oj k m k + . . .+oj l nii, a = + . . . +m , 0o = u} l (rii—mi) +U) l ~ 1 rn,i-i + . . . + m . 

Consider the game G(E ao © E^,Ep © C 7 ). The strategy for Duplicator is as follows. First 
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move the token to the last ^(0,1) i n Gp a . Then follow the strategy given by the inequality 
E ao < F(p i), as long as Spoiler stays in E ao . If he stays there forever, Duplicator wins. If 
Spoiler moves to Ej, Duplicator should do the same and keep copying Spoiler's move from 
that moment on. This also guarantees winning. The proof for Dp is entirely analogous. 

In order to prove that the inequalities are strict it is enough to show that C a % D a and 
D a % C a . We only prove that C a % D a ; the proof for D a C a is entirely analogous. Let 
us proceed by induction. The assertion holds for a = 1: the whole space is not reducible to 
the empty set. Let us take a > 1. By the definition, C a = i^o.fc) ffi Eyi D a = ffi Ej, 

where a = u + 7. Consider the game G(F^ ^ ffi ^7,-^(1,^+1) © E^). We have to find 
a winning strategy for Spoiler. If Duplicator never leaves Spoiler can stay in Fiqja 

and win using the strategy given by the Lemma 18.11 (3). Otherwise, after Duplicator enters 
Ky, he must make choice between C 7 and Z? 7 . Spoiler should loop in any loop of i^Ufc) 
waiting for Duplicator's choice. When Duplicator chooses one of C 7 , l) 7 , Spoiler should 
choose the other one and use the strategy given by the induction hypothesis. □ 

The third step is proving closure by natural operations. For word automata only the 
operations © and V make sense. The operation V is defined just like for trees. To define 
ffi, simply assume that all states are leftmost. It is easy to see that = is a congruence with 
respect to © and A. Both operations are associative up to =. 

Proposition 8.3. For each A\,A 2 € C, one can find in polynomial time automata Ay, G 
C such that A\ V A 2 = A y and A x © A 2 = A 9 . 

Proof. Closure by V is easy. For A x > A 2 it holds that A\ V A 2 = A%. Indeed, A\ \/A 2 > Ai, 
as A\ V A 2 contains a copy of A\. For the converse inequality consider G{A\ V A 2 ,A\). In 
the first move, Spoiler moves his initial token either to Ai or to A 2 . If Spoiler chooses 
A±, Duplicator may simply mimic Spoiler's actions in his copy of A±. If Spoiler chooses 
A 2 , Duplicator wins by applying the strategy from G(A 2 , A\), guaranteed by the inequality 
A ± > A 2 . 

In the remaining case A± and A 2 are incomparable. But then A\ = C a , A 2 = D a for 
some a < uj u (or symmetrically). It is very easy to see that C a V D a = E a . 

Let us now consider Ai © A 2 . Since ffi is associative up to = and only depends on the 
=-classes of the input automata, it is enough to prove the claim for simple A\\ in order to 
obtain a canonical automaton for (A^ffi. . .ffi J 4[ n) )ffi J 4 2 , take A^ ®{Af ) ffi. . . (A^ n) ffiA 2 ) . . .). 
Let us first consider A\ = C u k. Observe that if C^u > B, C u k ffi B = C^k. It is enough to 
give a strategy for Duplicator in GiC^k ® B, C^k), since the other inequality is obvious. To 
win, Duplicator should first copy Spoiler's actions, as long as Spoiler stays in C^k. When 
Spoiler moves to B, Duplicator should simply switch to the strategy from G{B ,C u k). 

Using the property above, we easily reduce the general situation to one of the following 
cases: C^k ©CL^fc+i, C^k ®D u k, or C^k (BE^k. In the third case, the automaton is already 
canonical. Let us calculate the result in the first two cases. 

In the first case we have C^k ffi CL,k+i = CL>*+i • Consider the game G(C u k ffi 
G VLU k+i,C vljJ k+i). Let C^i be the head component of C^k+i. It holds that I > k. In 
order to win the game, while Spoiler stays inside C^k, Duplicator should stay in C u i and 
use the strategy from G(C u k,C u i). When Spoiler enters CL^+i, Duplicator may simply 
copy his actions. The converse inequality is trivial. 
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Figure 8: An initial segment of the Wagner hierarchy 

In the second case there are two possibilities. If the head component of D^k is D^i 
with I > k, proceeding as before one proves C u k ffi D^k+i = D^k+i. But if I = k, we have 
C u k ffi D 'k = C vaj k +UJ k. Consider the game G{C rjU kj rtl3 h,C w k ffi D^k). While Spoiler stays 
in C^k, Duplicator should copy his actions. When Spoiler leaves C u k, he has to choose 
between D w k and the next copy of C w k . If he chooses D u k , Duplicator also moves to his 
copy of D^k, and mimics Spoiler actions. Suppose Spoiler chooses C^k. Then Duplicator 
stays in his head component, and mimics Spoiler's actions, as long as he stays in C u h. When 
Spoiler leaves C u k, he enters the initial state of E^^k, where rf + 1 = 77. Duplicator should 

exit C w k , go past D^k , and enter his copy of E^^k . From now on, he can copy Spoiler's 
actions. 

For A\ = D^k, simply dualise the claims and the proofs. For A\ = E^k, note that 
E u k ffi ^2 = C^k ffi V D^k ffi A2, and the equivalent canonical automaton can be obtained 
by previous cases. □ 

Let us now see that the hierarchy is complete for word languages. 

Theorem 8.4. For each word automaton A one can find in polynomial time a canonical 
non-branching automaton B such that L(A) =w L(B). 

Proof. We will proceed by induction on the height of the DAG of strongly connected com- 
ponents of A. Without loss of generality we may assume that all states of A are reachable 
from the initial state. In such case, the DAG of SCCs is connected and has exactly one root 
component, the one containing the initial state of the automaton. 

Suppose that the automaton is just one strongly connected component. Let (t, k) be the 
highest index for which A contains a (t, /c)-flower. It is well defined, because if A contains 
a (0, fc)-flower and a (1, k + l)-flower, it must also contain a (0, k + l)-flower. By Theorem 
12.21 A is equivalent to a (l, «;)-automaton and so, by Lemma I8TT1 A < Fr tjK y On the other 
hand it is easy to see, that in G(Fr t K \,A), Duplicator may easily use the (i, re)-flower in A 
to mimic Spoiler's actions in Fu K \. Hence, A = Fn K \. 

Now, suppose that the DAG of SCCs of A has at least two nodes. Let X be the root 
SCC. Like before, let (l, k) be the maximal index such that X contains a (t, K)-flower. Let 
qi , . . . , q m be all the states reached by the transitions exiting X (the "initial" states of the 
SCCs that are children of X). Recall that A q is the automaton A with the initial state set 
to q. Let Bi be the canonical non-branching automaton equivalent to A Qi . It is easy to see 
that A = F {hK) ffi (Bi y B 2 V ... VB m ). ' □ 

9. The Use of Replication 

Branching automata are defined by iterating The significance of — > lies in the fact 
that closing the family of non-branching automata by this operation gives, up to Wadge 
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equivalence, almost all deterministic tree languages (only C w u>-3, C w w3 + i, and will 
be denned by means of a stronger replication). In particular, we will show that the operation 
A is not needed. In other words, — ► is everything that deterministic tree automata have, 
which word automata have not. Let us see then what the use of the operation — > is. 

There are two kinds of simple branching automata. The first one is obtained by iterating 
— > on C3, and generalises C n . Intuitively, C n = C\ ® (n — l)E\ lets a player in an automata 
game change his mind n — 1 times in the following sense. First, the player moves his (only) 
token along the head loop. The head loop is accepting, so if he keeps looping there forever, 
the resulting run will be accepting. But after some time he may decide that producing an 
accepting run is not a good idea. In such a case he can move to the rejecting loop in the 
first copy of E\. Later he may want to change his mind again, and again, until he reaches 
the last copy of E\. Now, when the player is in charge of = C\ — > C3 he can choose 
a number n < to, and looping in the head loop of C w produce n tokens in the head loop of 
his copy of C3. We will see that with those tokens it is possible to simulate any strategy 
designed for C n+ 2- In other words, C w offers the choice between C n for arbitrarily high 
n > 3. The automaton C w 2 = C\ — > [C\ ® (C\ — ► C3)) lets you choose the number of times 
you will be allowed to choose some C n , and so on. 

The second kind of simple branching automata, obtained by iterating — > on 
does the same with C^+n instead of C n . For instance, C^-i = C\ — ► C^+i lets the player 
choose any C^+n = (X>™ (see page [T8|) . and in consequence L(C w w2) is hard for the class 
of regular languages of words. 

Let us now see the proofs. The first lemma justifies the name replication. 

Lemma 9.1. For all automata A, B and all < k < to, 

(1) A -» B > (A -> B) A (B) k , 

(2) Ci -» B > (B) k . 

Proof. To see that (1) holds, consider G((A — > B) A (B) k ,A — > B). Spoiler's initial moves 
produce a token x in the head loop of A — > B, and tokens x%, ... ,X}~, each in a different 
copy of B. Duplicator should loop his starting token y around the head loop of A — > B 
exactly k times producing for each Xi a doppelganger yi and move them all to the initial 
state of B (see Fig. [9}. From now on y mimics x, and yi mimics Xi for i = 1, . . . , k. 
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For the proof of (2) it is enoug h to check that (Ci -» B) A {B) k > (B) k . Clearly 
Ci^B> Ci. By Lemma ESI (Ci -» B) A (B) k > G\ A (£) fc , and the claim follows. □ 

Next we need to calculate the value of (C3)" and (C LU ^+i) n . Apart from canonical (l, k)- 
flowers F(t,,K.)i we consider the following automata containing weak (t, ft)-flowers (see page 
ED: 

WF {0)n) = C 1 ®D l @C l ®Di®... i , WF {hn+1) = Pi Ci £ > i C\ . . ■ . 

n+1 n+1 

We will refer to these automata as weak (i, k) -flowers too. In fact, W-F(o,n) = Cn+i> 
W-^(i,ti+i) = D n+ \, but we find the notation convenient. 

A pair ^2) £ w x oj is called ewen if both and Z2 are even. Otherwise (*i , ^2) is 
odd Let [t, k] denote the set {1, 1 + 1, . . . , k} C with the natural order. Consider the 
set [i,tz] x [//,«/] with the product order: (xi,yx) < (0:2,2/2) if ^1 < ^2 and y\ < y 2 . For 
m = 0, 1 and n > m define an alternating chain of type (to, n), or (to, n)-chain, as a sequence 
(i m ,fa) < (x m +i,y m +i) < ••• < (x n ,y n ), such that (xj,^) is even iff i is even. Suppose 
we have a (to, n)-chain of maximal length in [1, k] x [l', k']. The parity of n is equal to the 
parity of (k, k'), as defined above, for otherwise we could extend the alternating chain with 
(k, k 1 ) and get a (to, n + l)-chain. Consequently, the following operation is well-defined: 

(t, k) A (i ; , k!) = the type of the longest alternating chain in [l, k] x [//, k'] . 
Lemma 9.2. For all indices and (i 2) re 2) it ZioWs t/iai 

El AC 1 77 

r (llAl) M - r (t2,K2) ^ - r (tl,Kl)A(t2,«;2) ' 

^(ti./ci) A W-F( t2 , K2 ) = WF ((lAl)A(l2A2) . 

in particular, (F( ^)) k = F(o,2fc) ar1 ^ (W^ ? (0,2))' c = W^-F(o,2fc)- Equivalently, (C^+i) k = 
C UJ u+i+2k and (Cz) k = C^fc+i- 

-Proo/- By Lemma EU L(F {i)j) ) = w L {iJ) , so L(F (tl)Kl) A F( t2)/t2 )) =w £( tl , Kl ) x L (t 2 ,K 2 ), 
where L x M = {{x\, yi)(x2, 2/2) ■ • ■ : £1^2 • • • 6 UiV2 ■ ■ ■ G We will show that 

£(ix,kx) x L (i 2 A2) =W where (t, «) = (ti,Ki) A (t 2) K 2 ). 

Consider the following automaton A. The state space is the set 

X [t2,K 2 ] ~> {0,1,2} 

and the initial state is the function constantly equal 0. The transition relation 5 is defined 
as (/, <t, g) G 5 iff for all z and j, (f(i,j),cr,g(i,j)) G where Sua is defined as 

oMl, for all k + i , 
1^2, 1 ^ 1 for all fc ^ j , 
2^1, 

with * denoting any letter. 

Let us now define the rank function. For i € and j E [^-2^2], let (</,«/) = 

(tl,i) A (i2,i) and rank(i,j) = re'. Observe that t! = 1, so 1 < re' < re. Set the rank 
of the states that never take the value 2 to l. For the remaining states set the rank to 
rank(maxfc ifc, maxfc jfc), where (12,^2), ■ ■ ■ > (v>ir) are the arguments for which the 

value 2 is taken. 
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Figure 10: The weak flower WFr Q\ formed by the leftmost states of C\ ffi C w m+i. 3 . 

Let us check that the automaton recognises Lu 1>K1 \ x Lu 2jK2 \. Take a word w = 
(xi, yi)(x2, 2/2) • • • • Let x = max^x/j and y = maxtj/jj. In the run of A on w, the states 
/ satisfying f(x,y) = 2 will occur infinitely often. Furthermore, from some moment on 
there only appear states / satisfying V^/y) f(x',y') = 2 (x',y') < (x,y). Since 

(x, y) < (x',y') ==>■ rank(x,y) < rank(x',y / ), the highest rank used infinitely often in the 
run on w is rank(x,y). Finally, rank(x,y) is even iff x and y are even, so the run on w is 
accepting iff w € X £( i2)/t2 )- 

Since A has the index (1, re), the automaton itself provides a reduction of Lr bl K1 \ xL( t2)K2 ) 
to i( t)/t )- 

By definition of (t, n), there exists a sequence of pairs 

(x b ,y L ) < (x t+ i,y t+ i) < ... < (x K ,y K ) 

such that for all i it holds that l\ < X{ < Ki, i 2 < 2/j < K2> an d and are even iff i is 
even. The reduction is given by the function 

ip(hi 2 i 3 ■■■) = (xi^yi^ixi^yi^ixi^y^) .... 

The proof for weak flowers is entirely analogous. □ 

Lemma 9.3. For all < k,l < to and all m < to 

Cl © C^k A Cl® C^m; = Cl © C w m(fc_|_J) , 
Cl © C^w-2+mfc A Cl © = Cl © C w u-2+mj^() . 

In particular, (Ci © C tlJ m) fc = Ci ffi C w m fc anc? (Ci ffi C^u,-i+ m ) k = C\ © C^.2+m fc . 

Proof. Consider G(Ci ffi C UJ m k A Ci ffi C w m ; , Ci ffi C lJJ m k ffi C w ™, ; ). Observe that Duplicator's 
critical token will move along a copy of WF^^k+21) formed by the leftmost states of con- 
secutive copies of C^m (see Fig. [T0]h Spoiler's initial token splits in the first move in two 
tokens which continue moving along WF^^k) an d WFiq^I)- For the purpose of this proof, 
call them both critical. 

The strategy for Duplicator is based on the fact that WF^ 2 k) A WF( 2 i) = WF( 2 k+2i) 
(Lemma I9.2() . Duplicator can loop his critical token inside an accepting loop as long as 
both Spoiler's critical tokens loop inside accepting loops. When Spoiler changes his mind 
and moves one of them to a rejecting loop, Duplicator should move to a rejecting loop too, 
and keep looping there until both Spoiler's tokens are again in accepting loops. This can 
only repeat k + I times, so Duplicator is able to realise this strategy. 

This way, whenever Spoiler produces a new token x using one of the critical tokens, 
Duplicator can produce its doppelganger y. The role of the doppelganger is to mimic the 
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original. The mimicking is in fact passed from generation to generation: if the original 
token bubbles a new token x', y should bubble a new doppelganger y' which is to mimic x' , 
and so on. 

In order to see that the strategy is winning it is enough to observe two facts: Duplicator's 
critical token stays in a rejecting loop forever iff one of Spoiler's critical tokens does, and 
the sequence of ranks seen by any of Spoiler's non-critical tokens is equal to the one seen 
by its doppelganger. Hence, C\ © C u) m k A C\ © C w m{ < C\ © C^m^+iy 

The converse inequality is proved in a similar way and for the second equivalence the 
same proof works. □ 

Corollary 9.4. For all 1,l,k < u and all < n < uj 

(1) C u > WF^, C^i+i > C u i n , 

(2) > F^ K ), C^u-2+i+i > 

Proof. Since C u = C\ — > C3 = C\ — > WFf ^, by Lemma [9.11 and Lemma [9.21 we get 
Coj > (WF(Q£) ) m = VFF( 0i 2m) an d by the strictness of the hierarchy for word languages 
C u > WFt t)K y Similarly, using Lemma [9H and Lemma [931 we get C^i+i > (C\ © C^i) 71 = 
C\ © C^i n > C w i n . The remaining two inequalities are analogous. □ 



10. Automata in Order 

Let us start examining the order on canonical automata with the following simple 
observation. 

Lemma 10.1. For all < a < lo^ 

Ca 5; , C a < ■ 

Proof. We give a proof for the first inequality; the second one is proved analogously. Con- 
sider the following strategy for Duplicator in G(C a , C u ^). In every move, if any of Spoiler's 
tokens is inside a rejecting loop, Duplicator should move his critical token around a 1-loop, 
otherwise he should loop around the 0-loop. Let us see that the strategy is winning. 

By Proposition 17.21 if Spoiler's run is to be accepting, he must produce only finitely 
many tokens. All of those tokens must finally get to some 0-loop, and stay there forever. 
This means that after some number of moves, all Spoiler's tokens are in 0-loops which they 
will never leave later. But from this moment on Duplicator's critical token will keep looping 
around the 0-loop, so Duplicator's run will also be accepting. 

By Proposition 17.11 if Spoiler's run is to be rejecting, there must be a token that from 
some moment on stays forever in a 1-loop. Then Duplicator's token will also get trapped 
in the rejecting loop in C w w, and Duplicator's run will be rejecting too. □ 

Let us now see that we can restrict the way the players use non-critical tokens. For 
a simple automaton A and a canonical automaton B = B\ © . . . ffi B n with Bi simple, we 
say that B dominates A if one of the following conditions holds 

• A is non-branching 

• A = Ci -> C a , Bx = d -> C fi , and (3 > a, 

• A = C w m and B\ = or B\ = F (t k) V for t < K. 
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Lemma 10.2. Let A%, A2, ■ ■ ■ , A n be simple and let B be a canonical automaton dominating 
all A{. For every deterministic automaton C, if Spoiler has a winning strategy in G{A\ © 
. . . © A ri © B,C), then he also has a strategy in which he removes all non-critical tokens 
before entering B. Similarly for Duplicator in G(C, A% © . . . A n © B). 

Proof. Let B = B\ @ . . . @ B n with Bi simple. Suppose that at some moment the strategy 
tells Spoiler to enter B (if this never happens, the claim is obvious). If there are no non- 
critical tokens left in A\, A2, ■ ■ ■ , A n , then we are done. However if there are, we have to 
take extra care of them. Suppose Spoiler has produced non-critical tokens xi, . . . , x r ^ and Xi 
is in A mi . Since x% is not on a critical path of A mi , by the definitions of canonical automata, 
it will stay within a copy of C ai over the alphabet extended to the alphabet of B. 

Suppose B\ = C\ — * Cp. Since B dominates Aj, f3 > cti for all i. Spoiler should replace 
the token Xi with x\ and let x\ take over the duties of Xj. To produce x^, Spoiler should 
loop once in the head loop of B%. If B\ = C^k, or Ai = C^. 2 +k', Spoiler may simply move 
x\ to a copy of C ai and let it perform exactly the actions Xi would take. If j3 = C u u-2+k, 
oti = C^k', Spoiler should move x\ to the copy of -F(o,2) contained in Cp, and let it apply the 
strategy guaranteed by Lemma [10.11 To see that the strategy is applicable, it is enough to 
note that it does not require any waiting, and that -^(0,2) contains a copy of -^(0,1)- 

Suppose now that B\ is non-branching. Then, «j < uo^ for all i. In this case Spoiler 
cannot produce a token to take over x^s duties. Instead, he has to modify the actions of the 
critical token. He should move the critical token according to his original strategy moving 
from flower to flower, only when one of his non-critical tokens would be in a rejecting loop, 
he should choose a 1-loop in his current flower (instead of the loop suggested by the old 
strategy). Just like in the proof of Lemma flO.ll if in a play according to the original strategy 
one of the non-critical tokens stays forever in a rejecting loop, then in the game according 
to the new strategy the critical token finally also gets trapped in a 1-loop. Otherwise, there 
are only finitely many non-critical tokens, and all of them finally stabilise in an accepting 
loop. From that moment on, the critical token will see exactly the same ranks as it would 
see if Spoiler was playing with the original strategy. Hence, the modified strategy is also 
winning. 

If the original strategy brings Spoiler to a branching automaton, he should produce 
counterparts of his non-critical tokens just like above. □ 

Corollary 10.3. For every canonical automaton of the form A@B and every deterministic 
tree automaton C, if a Spoiler has a winning strategy in G(A © B,C), than he has also 
a winning strategy which removes all non-critical tokens before entering B. Similarly for 
Duplicator in G(C, A © B). 

Proof. Let A = A\@A2@. ■ -®A n with Ai simple. From the structure of canonical automata 
it follows that if A © B is canonical, B dominates A{ for i = 1, 2, . . . , n. □ 

Now we are ready to get back to the order on C. 

Lemma 10.4. If < a < j3 < w w ' 3 then C a < Cp and whenever D a and Ep are defined, 
D a < Ep, C Q < Ep. If 13 < a, then Ep < D a , Ep < C a . 

Proof. As an auxiliary claim let us see that if A(BB is canonical and A' > A, A@B < A'®B. 
Indeed, the following is a winning strategy for Duplicator in G(A(BB, A'®B). While Spoiler 
keeps inside A, apply the strategy from G(A, A'). If Spoiler enters B, by Corollary 110.31 we 
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may assume he removes all non-critical tokens. Hence, Duplicator may remove non-critical 
tokens, move the critical token to B and copy Spoiler's actions. 

Let us now see that C a < Cp for a < (3 < u/ 1 '' 3 ; the other inequalities may be proved 
in an analogous way. We will proceed by induction on (a, (3) with lexicographic order. If 
(3 < lu, the result follows by the word languages case. Suppose that uj < f3 < uj w . Let 
a = u; fc m/% + . . . + mo and [3 = uj + . . . + no, > 0. First, assume that m& = 0. 
Obviously C u k < Cp, simply because Cp contains a copy of C^k. If k = 1 the claim follows 
directly from Corollary [931 For k > 1, using the induction hypothesis and Corollary 19.41 we 
get C a < C u) k-i( rrik _ 1 +i) < C w k. Now, assume that > 0. Then a = uj k + a', (3 = uj k + /?' 
for some ordinals a' < (3' . By definition C a = C a > © C^k, Cp = Cp> © C^k, and by induction 
hypothesis, C a i < Cpi. Hence, by the auxiliary claim above, C a < Cp. 

Now, suppose that lo w < (3 < lo^' 2 . Let a = w^ai+ao, (3 = lo^Pi+Pq for ao, «i, Po, (3i < 
oj u . If a± = Pi, then by induction hypothesis C ao < Cp , and C a < Cp follows by the 
auxiliary claim above. Assume that ai < Pi. By Lemma 110. H C ao < C w ^. Replacing 
G(C ao ,Cp ) with G(Cq, ,C w w) in the above strategy, we get C a = C ao © E^ ai < C^w © 
B^d = Cu"( ai +i)- By Proposition 18.21 C^( Ql+1 ) < C^p x and since C^p x is contained 
in C^^^+^q, we get C^ ai+ao < C^p 1+ p Q . Observe that the argument works also for ao or 
Po equal to 0. 

The case uj^' 2 < P < c^' 3 is analogous to u> < P < uj 1 ^ . □ 

For a complete description of the ordering on the canonical automata (see Fig. [TT]) we 
need the strictness of the inequalities from the previous lemma. 

Theorem 10.5. Let < a < P < w w ' 3 . Whenever the respective automata are defined, 
it holds that D a ^ C a , D a ^ C a , D a < Ep, C a < Ep, and for a < P, C a < Cp, 
E a < Dp, E a < Cp. 

Proof. By Lemma 110.41 it is enough to prove C a < C a+ i, D a < E a , C a < E a , D a ^ C a , 
D a ^ C a . We will only give a proof of the first inequality; the others can be argued similarly. 
We will proceed by induction on a. If a < u, the claim follows by the word languages case. 

Suppose u < a < uj^ . Then a = uj k + a' with k > 1, a! < uj k+l . Let a' > 1 
(the remaining case is similar). We shall describe a winning strategy for Spoiler in G = 
G(C u) k +a / + i,C UJ k +a i). Spoiler should first follow the winning strategy for G(C Q / + i, C a i), 
which exists by the induction hypothesis. Suppose that Duplicator enters the head loop of 
C^k. We may assume that he removes all his non-critical tokens (Corollary 110.31) . Spoiler 
should remove all his non-critical tokens, move his critical token to any accepting loop in 
CV+i. Let us check that such a loop is always reachable for the critical token. 

Let C Q '+i = A(B B with B simple. If B = Ci — » B', Spoiler can move his critical token 
to C\. If B is not of this form, then by definition of canonical automata, Cq'+i = C2 n +i 
or CqZ+i = D 2n - Recall that C 2n +i = WF^^n) = WF^ 2n -i) © C\ and D 2n = WF^ >2n ^ = 
WFrypn-l) © Cl ( see page [22]) . It follows that in any play on C 2n+ i or D 2n , if one has a 
winning strategy, one also has a winning strategy never entering the rejecting loop of the 
tail component. Hence, the accepting loop in the tail component is always reachable (or 
has been reached already). 

Thus, Spoiler can move his critical token to an accepting loop in the tail component 
of C a > + i and loop there until Duplicator leaves the head loop. If Duplicator stays forever 
in the head loop of C u k, he loses. Suppose that Duplicator leaves the head loop of C u k 
after producing r tokens. The rest of the game is equivalent to C = G(Ci © C^kjA) for 
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G\ C2 Cu — Cuj + l . . . Cu^ + 1 Cu^+uj — Cuj^ +uj + l ■ ■ ■ 

\ / \ \ / \ 

E\ E 2 ••• E u u, E u a, +1 ■■■ 

/ \ / / \ / 

Figure 11: The Wadge ordering of the canonical automata. 

A = A\ A . . . A A r , where Aj is the part of C^k accessible for the Duplicator's j'th token. 
If k = 1, then Aj < WF^^) f° r each j. Hence A < WF^^r) an d by Corollary 19.41 Spoiler 
has a winning strategy in G' . Let us suppose k > 1. Then ^4, < C\ © C^k-i for j = 1, . . . , r 
and so, by Lemma l53| A < (Ci © C^k-if ■ Hence, by Lemma EH A < C u k-i r+ i. Since 

LO k-l r + 1 < UJ k-l r + 2 < 

oj k , we may use the induction hypothesis to get a winning strategy 
for Spoiler in G . In either case Spoiler has a winning strategy in G as well. 

Now, assume uj w < a < uj w ' 2 . Let a = cj^ai + ao with ao < oj w , 1 < «i < u w . Again, 
we describe a strategy for Spoiler in G = G(C a; " Q , 1+Qo+ i, C UJ Lj ai+OC0 ) only for ao > 1, leaving 
the remaining case to the reader. First follow the winning strategy from G(C ao +i, C ao ). If 
Duplicator does not leave the C ao component, he will lose. After leaving C af) , Duplicator 
has to choose D^ a ^ or C^ ai . Suppose he chooses D^ ai . Again, by Corollary |10.3l we may 
assume that he removes all non-critical tokens. Now, Spoiler has to remove all non-critical 
tokens and move the critical token to the initial state of E u u ai and use the winning strategy 
from G(E^ ai ,D^ ai ). 

For a = oj^^+k + a' argue like for a = u k + a'. □ 



11. Patterns in Automata 

Compare the notion of (t, K)-flower defined in Sect. [2] and the canonical flower F/ L>K y It 
is fairly clear that if A contains a (l, K)-flower, Duplicator can win in G(-F( t K ), A) by copying 
Spoiler's actions. In that case it seems plausible to look at A as if it "contained" a copy of 
Fu K y In this section we provide a notion which captures this intuition. 

' rp <A , a> 2 ,d> 2 a' m ,d' m , « „ 4', < „ . . ... 

1 wo paths p — > p\ — > . . . — > p m and p — > p[ — > . . . — > p„ m a deterministic 
automaton A are branching iff there exists i < min(m, n) such that for all j < % it holds 
that (a'j, d'j) = (a", d"), o[ = a'/, and d[ 7^ d". Note that the condition implies that p'j = p" 
for j < i. 

An automaton B can be embedded into an automaton A, if there exists a function 
e Q '■ Q B Q A an d a function e$ : Q B x S B x {0, 1} — > U A , where U A is the set of paths 
in A, satisfying the following conditions: 

(1) if p — > q and e s (p, a,d) = r — ► n — > ... — > r n then r = e Q {p), r n = e Q (q), 

(2) for all p, a the paths e$(p, a, 0) and es(p, cr, 1) are branching, 

(3) for every loop A in B, the corresponding loop in A (obtained by concatenating the paths 
assigned to the edges of A) is accepting iff A is accepting. 

For each tree automaton A, let A' be the automaton obtained from A by unravelling 
the DAG of strongly connected components into a tree (for the purpose of this definition, we 
allow multiple copies of _L). An automaton A admits an automaton B, in symbols B C A, 
if the automaton B' can be embedded into A. Note that if B can be embedded into A, then 
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Lemma 11.1. For all deterministic tree automata A and B 

AQB A<B. 

Proof. Since L(B') = L(B), without loss of generality we may assume that B = B'. We 
have to provide a winning strategy for Duplicator in G(B,A). Without loss of generality, 
we may assume that Spoiler never removes his tokens. Let eQ and e$ be the embedding 
functions. We will show that Duplicator can keep a collection of doppelg angers, one for each 
Spoiler's token, such that if some Spoiler's token x is in the state p, its doppelganger y is in 
the state eg(p). 

Let us first assume that eQ{q$) = q$ . Then the invariant above holds when the play 
starts. As long as Spoiler does not enter _L, the invariant can be maintained by means of 
the function es as follows. Suppose that Spoiler fires a transition q —> q', q" for some token 
x obtaining new tokens x' and x" . Let 

/ n N VlAl Ol-lA-l <*lA I a 'l + l' d 'l + l °"m: d m / 

e 5 {q,a,0) = p — ► . . . — > Pi-i — ► p x — > ... — > p m , 

, <?\A <Tl-lA-l a lA{ I, rf+lA'+l OnAn It 

e s {q,a,l)=po — >■■■ — > Pi-i — > p t — > ■■■ — > p n , 
with d\ = 1 - d'{. 

Let r^r'-^r'l be such that pi-\ r i: p'-_ x -^4 r'j, and p" k _ x ^ r'l for 1 < i < I, 

I + 1 < j < m, I + 1 < k < n, where d = 1 — d. 

Recall that we assume that for every transition, either both target states are _L or none. 
Since q' ^ _L and q" ^ _L then, by the condition (3) of the definition of admittance, p' m ^ _L 
and p'n 7^ _L and consequently all the states pi, ri, p'j, r'j, p'l, r'l are not equal to _L. Hence, 
Duplicator can proceed as follows. Starting with the token y (the doppelganger of x), fire 
the transitions forming the common prefix of both paths, each time removing the token sent 
to rj. Thus he reaches the state pi-i with a descendant of the token y. Then he should fire 
the next transition producing two tokens y' and y", and for each of them fire the remaining 
sequence of transitions (again removing the tokens in the states r'- and r'£). Thus he ends 
up with two tokens in the states p' m = eg(g') and p" n = eq(q"). Hence, the token in ec)(q') 
may be the doppelganger of x', and the token in eQ(q") may be the doppelganger of x" . 

Let us see that if Spoiler never enters L, Duplicator wins. Observe that the function 
es induces a function e from the set of infinite paths in B to the set of infinite paths in 
A. Owing to the condition (3), e(ir) is accepting iff ir is accepting. The strategy used by 
Duplicator guarantees that for each path tt in Spoiler's run, Duplicator's run contains the 
path e(w). The paths in Duplicator's run that are not images of paths from Spoiler's run 
were all declared accepting by removing the corresponding tokens. Hence, Duplicator's run 
is accepting iff Spoiler's run is accepting. 

Now, if Spoiler enters _L, Duplicator proceeds as before, only if some rj, r'j, or r'l is 
equal to ±, instead of removing the token from there (he is not allowed to do that), he lets 
the token and all its descendants loop there forever. In the end, again each path in Spoiler's 
run has a counterpart in Duplicator's run. The images of the rejecting paths (which exist 
in Spoiler's run), will be rejecting too. Hence, Duplicator also wins in this case. 

Finally we have to consider the situation when eQ{q$) / q$. In this case, Duplicator 
should first move his initial token to the state eQ(q$), removing the other tokens produced 
on the way whenever possible, and then proceed as before. □ 
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Another property that makes admittance similar to containment is transitivity. 

Lemma 11.2. For all deterministic tree automata A, B, and C , 

AQBQC => A CC. 

Proof. Again, we may assume that A' = A. Furthermore, since the states from one SCC 
have to be mapped to states from one SCC, then A can be embedded directly into B'. Hence, 
we may also assume that B = B' . Let eg' , ef ,Y be functions embedding the automaton 
X into Y. The embedding of A into C is simply a composition of two given embeddings: 
eg' = Cq ,C o eQ C , ef' C = e^' C o ef' B , where e^' C : Ii B — > Ii c is the function induced by 

e B ' C in the natural way. It is easy to see that eq C and ef' C satisfy the conditions from the 
definition of admittance. □ 

Embedding for automata on words is defined analogously, only the function eg is defined 
onQxS instead of Q x S x {0, 1}, and the condition (2) is dropped. Admittance is defined 
identically. The two lemmas above carry over with analogous proofs. 

12. Hard Automata 

In previous sections we have described an extended hierarchy of canonical automata. 
As we have already mentioned there are still three canonical automata left to define. In 
their definition we will make the first use of a stronger variant of the operation — >. 

In the operation >, instead of one (rejecting) loop replicating an automaton, we 
have a whole flower whose each loop replicates a different automaton. Recall that Fu K ) 
is an automaton whose input alphabet is {a L ,a L+ i . . . ,a K }, the states are 9 t ,9i+i, . . . ,q K , 
rank(gfj) = i, the initial state is q L , and transitions 

9i — ►gtjT, q t — >qj,T, qj — > q ,T , and qj — >T,T 

for j = l + 1, i + 2, . . . , k and k ^ j. Let A, A L , . . . , A K be deterministic tree automata over 

(t ft) 

S. The (i, k) -replication A — A L , . . . , A K (see Fig. [121) is obtained as follows. Take a copy 
of Fu K \ over the extended alphabet {a L , a t+ i . . . , a K } U S U {b}, where b is a fresh letter. Add 
single disjoint copies of A L , . . . , A K and A over the extended alphabet E U {a M a L+ \ . . . , a re } U 

{fe}. Finally, in F/ l)K \ over the extended alphabet, replace the transition q L r (where 

r € {_L, T}) with q L — q$, and q L ^* T with q L -^i q^ 1 for i = i, . . . , k. 

Using Lemma 15.51 it is easy to see that the =-class of the defined automaton depends 
only on (i, k) and the =-classes of A, A L , . . . , A K . Hence, = is a congruence with respect to 

— > tor every (t, n). 

Note also that A — > B and A i? are equal up to the names of letters and states. In 

particular L(A — > i?) =vk L(^4 i?). 

Let us now define the three missing automata. Let C u u>-z = C\ ^—l C\, C(o,2) an< i 

C w u-3 + i = Ci ^> ^(0,1) ■ The last automaton, C^-3 +2 consists of the states qo, q\, T with 
rank((/j) = i and transitions 

9o 9o,9i , 9o-^T,T, 
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Figure 12: The (1, 4) -replication A ^4 A 1 ,A 2 ,A 3 ,A 



Qi — ►go,T, gi — ► T, T . 
Using canonical automata we can formulate results from Sect. [3] in a uniform way. In the 
proof we will need the following technical lemma. 

Lemma 12.1. Let A be a deterministic tree automaton. For every productive state p in A 
there exists a state q, a path tt p from p to q, and pair of branching paths n®, tt^ from q to q 
forming accepting loops. 

Proof. Take an accepting run starting in p. For each node v of the run let S v be the set of 
states of the automaton that appear below v. Note, that if v' is a descendant of v, S v / C S v . 
Since all 's are non-empty, there exists a node u such that for all descendants v! of u, 
S u ' = S u . Pick a state q 6 S u . There exists a node w under u, labeled with q. Both wO 
and wl are labeled with S u , so there exist a nodes wq under wO and w\ under wl that are 
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also labeled with q. To conclude, let tt p be the path in A induced by the path from e to w, 
and let irl be the path induced by the path between w and Wi for i = 0, 1. 1 □ 

Theorem 12.2. Let A be a deterministic automaton. 

(1) L(Ci © Di) is Uncomplete; L(A) G i/f ^4 does no£ admii Ci © Z?i. 

(2) L(Di © Ci) is X!°- complete; L(A) G TI^ i/f ^4 does no£ admii Di © C\. 

(3) L(F (1)2) ) and L(Ci ^ (Z>i © Ci)) are Uncomplete; 

L{A) £Y,%iffA does not admit F (1)2) nor Ci ^1 (D x © Ci). 

(4) L(F(q 1 )) is £ 2 -comp/eie; -^(^4) £ n 2 ijff A does not admit ^(o.i)- 

(5) £(C w w3 + i) is U\- complete; L(A) G £3 i/f A does noi admit C^ 3 +1 . 

(6) -L(^4) G II3 i/f ^4 does no£ admit C^ 3 +2 . 

(7) L(C w ^ 3 +2 ) is Il\-complete; L(A) is Il\-complete iff A admits C £J w3 +2 . 

Proof. It is enough to check that for an automaton it is the same to contain the patterns 
from Theorem 13.21 (page [7]) and to admit the respective automata. It is straightforward to 
check that it indeed is so. The only difficulty is embedding the transitions to all-accepting 
states, but this is solved by Lemma ll2.il Let us just see the case of C w w3 +2 . If A admits 
C w w3 +2 , then the image of the two loops in C^ 3 +2 that contain the initial state is a split. 

Suppose that A contains a split consisting of an i-loop p p\ . . . p m+1 = p 

and a i-loop p p[ -^-t . . . p' n+ i = p, such that i is even, j is odd, and i < j. 

Without loss of generality we may assume that m, n > 1. Let p\ — > q , and let t p , t p i , t q i 
be the states guaranteed by Lemma ll2.1l for p, p\, and q' respectively. 

Let B be the automaton obtained from C a; ^-3_|_ 2 by unravelling the DAG of SCCs. The 
only way it differs from C a; ^-3 +2 is that instead of one state T it contains 5 all-accepting 
states Ti, . . . , T5, one for each transition from the root SCC: 

<7o <7o,<7i , <7o-^Ti,T 2 , 

a b 

Qi — ><?o,T 3 , gi — >T 4 ,T 5 . 
Define e Q (q ) = p, e Q {qi) = p[, eg(Ti) = eg(T 2 ) = tp, eg(T 3 ) = t q >, e Q (T 4 ) = e Q (T 4 ) = 
t p i . The function eg is defined as follows: 

e s (q ,a,0)=p — > p x — — >p, e s {qi, a, 0) = p\ — ^ . . — >p, 

e s (q , a, 1) = p ^ p[ , e s (qi,a, 1) = (p[ g') 71 "?' > 



e 5 (qr , &, 0) = TTpTrP , e s (qi,b, 0) 

e5(go, b, 1) = TTpTr^ , e s {qi,b, 1) = V^y 

e,5(T 4 ,*,0) =7T° 



'eo(Ti) ' 

e,5(T 4 ,*,l) =vr e 1 g(Ti) , 

where * denotes any letter and by 7ri7r 2 we mean the concatenation of two paths. Checking 
that this is an embedding is straightforward. □ 



^This elegant proof was suggested by one of the referees in place of a clumsier inductive argument. 
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For the proof of the next theorem, settling the position of the last canonical automaton, 
C^ 3, we will need the following property of replication. 

Lemma 12.3 (Replication Lemma). A state occurs in infinitely many incomparable nodes 
of an accepting run iff it is productive and is replicated by an accepting loop. 

Proof. If a state p is replicated by an accepting loop, then by productivity one may easily 
construct an accepting run with infinitely many incomparable occurrences of p. Let us 
concentrate on the converse implication. 

Let p occur in an infinite number of incomparable nodes vq,Vi,... of an accepting 
run p. Let 7Tj be a path of p going through the node vi. Since 2 W is compact, we may 
assume, passing to a subsequence, that the sequence 7Tj converges to a path tt. Since V{ 
are incomparable, Vi is not on it. Let the word on be the sequence of states labelling the 
path from the last common node of it and 7Tj to V{. Cutting the loops off if needed, we may 
assume that \a%\ < \Q\ for all i € lo. Consequently, there exist a word a repeating infinitely 
often in the sequence ao, ax,.... Moreover, the path it is accepting, so the starting state of 
a must lay on an accepting productive loop. This loop replicates p. □ 

Theorem 12.4. L^C^u-z) is Wadge complete for deterministic A3 tree languages. In par- 
ticular, A < C w m-z for each A £ C. 

Proof. Since C^w-z admits neither C u 

w ' 3 +2 nor Ccj^'^+i? -^(Ca;^' 3 ) is a deterministic lan- 
guage (Theorem I12.2p . Let us see that it is hard in that class. 

Take a deterministic automaton A recognising a A^-language. By Theorem MM (5), A 

does not admit C^-s+x = Cx — '-^ F(o,x) ■ Let us divide the states of A into two categories: 
a state is blue if it is replicated (see page [7]) by an accepting loop, otherwise it is red. Note 
that every state reachable from a blue state is blue. 

Let A' be the automaton A with the ranks of red states set to 0, and let A" be A 
with the ranks of the blue states set to 0. Let us see that A < A' A A" . The strategy for 
Duplicator in G(A, A' A A") is to copy Spoiler's actions in A, both in A' and A". To show 
that this strategy is winning it is enough to show that for each t a run of A on t is accepting 
iff the runs of A' and A" on t are accepting. Take a path tt of the run of A. Let tt' and tt" 
be the corresponding paths of the computations of A' and A". If tt only visits red states, 
then the ranks on tt and tt" are identical, and tt' contains only 0's. Otherwise, tt enters 
a blue state at some point, and then stays in blue states forever. In such case, the blue 
suffixes of tt and tt' have the same ranks, and the blue suffix of tt" contains only 0's. Thus, 
tt is accepting iff tt' and tt" are accepting and the claim follows. 

Since A does not admit Cx -^(0,1)1 ^ follows that all (0, l)-flowers in A are red. 
Consequently, A' does not admit -^0,1) > and so L(A') is LT^. Since L(F( 12 )) is LT^-hard 
(Theorem dil (3)), A' < F (1)2) . 

Now consider A" . Once you enter a blue state, you can never move to a red state. 
Consequently, since in A" all blue states have rank 0, we may actually replace them all 
with one all-accepting state T without changing the recognised language. Recall that, by 
convention, instead of putting tokens into T we simply remove them. Hence, when for some 
token in p a transition of the form p — > T, q or p — > q, T is fired, we imagine that the 
token is moved to q without producing any new tokens. By the Replication Lemma (Lemma 
112. 3[) the occurrences of red states in an accepting run may be covered by a finite number 
of infinite paths. Hence, by our convention, only finitely many tokens may be produced in 
a play if the constructed run is to be accepting. 
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Let us now show that Duplicator has a winning strategy in G(A", (C\ — Cx,Fn K \)), 
where (t, k) is the index of A. Whenever Spoiler produces a new token (including the starting 
token), Duplicator should loop once around the head 1-loop producing a doppelg anger in 
Fuk)) an d keep looping around the head 0-loop. The new token is to visit states with 
exactly the same ranks as the token produced by Spoiler. Let us see that this strategy 
works. Suppose Spoiler's run was accepting. Then, there were only finitely many red tokens 
produced, and hence the head 1-loop was visited only finitely often. Furthermore, each 
Spoiler's token visited an accepting path. But then, so did its doppelg anger, and Duplicator's 
run was also accepting. Now suppose Spoiler's run was rejecting. If infinitely many red 
tokens were produced, the head 1-loop was visited infinitely often, and Duplicator's run was 
also rejecting. If there were finitely many tokens produced, then one of the tokens must 
have gone along a rejecting path, but so did its doppelgdnger and Duplicator's run was also 

rejecting. Hence A" < (Cj Ci,F^ k) ). 

By Lemma E3J A' A A" < (C\ Ci,F M ) A F (lj2) , so it is enough to check that 

(Ci — I C\,Fu K \) A ^(1,2) — C u u-3. Consider the following strategy for Duplicator in the 

game G((C\ — > Cx,Ft t K \) A -F(i 2)> C^us). First, loop once around the 1-loop and produce 
a new token in -F(o,2) an d use it to mimic Spoiler's actions in ^(1,2) ■ Then, for each new 
token x Spoiler produces in his 1-loop and sends to Fi hK \, Duplicator should produce tokens 

yi, . . . ,y L K±ij in F (0)2 ). By Lemma E21 (^(0,2)^^ = -F (0 ,2L^J) < F( hK )i so Duplicator 

has a winning strategy in G(Fr^ K \, (i^(o,2))'" 2 "')■ Adapting this strategy, Duplicator can 
simulate the actions of Spoiler's token x in F^ K ^ with the tokens yi, ■ ■ ■ ,y, K +ii in ^(0,2) ■ 
If Spoiler loops the 1-loop without producing a new token, or loops around the 0-loop, 
Duplicator should copy his actions. Clearly, this strategy is winning for Duplicator. 

Finally, let us see that A < C^w-3 for each A E C. Take n < to. Observe that in 
no state is replicated by an accepting loop. Hence, may not admit C u u-s + i nor 

C^w3 + 2- By Theorem 112.21 £(C u u-a+i.) is in A3. By Lemma [10.41 for each A G C there 
exists m < lo such that A < Hence, for all A E C, L(A) E A3, and A < □ 

From Theorems ll2.2l and ll2.4l we obtain the following picture of the top of the hierarchy: 

C < C^ a < < ■ 

Let C' = C U C^w.34.2}. Note that it already follows that the height of the 

Wadge hierarchy of deterministic tree languages is at least (u^) 3 + 3. In the remaining of 
the paper we will show that each deterministic automaton is Wadge equivalent to one of 
the canonical automata from C , thus providing a matching upper bound. 

13. Closure Properties 

Our aim is to show that each deterministic tree language is Wadge equivalent to the 
language recognised by one of the canonical automata. If this is to be true, the family of 
canonical automata should be closed (up to Wadge equivalence) by the operations intro- 
duced in Sect. [6j In this section we will see that it is so indeed. The closure properties 
carry substantial part of the technical difficulty of the main theorem, whose proof is thus 
made rather concise. 
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Proposition 13.1. For A,B € C one can find in polynomial time an automaton in C 
equivalent to A\J B. 

Proof. Proceed just like for nonbranching automata (Proposition l8.3l page !19p . Take A,B& 
C. If A < B, then A V B = B and if B < A, then A V B = A. If A and B are incomparable, 
by Lemma 110.41 we get that they must be equal to D a and C a . It follows easily from the 
definitions of the canonical automata that D a V C a = E a . □ 

Proposition 13.2. For A,B € C one can find in polynomial time an automaton in C 
equivalent to A® B. 

Proof. Recall that simple automata are those that cannot be written as A\ © A2 for some 
canonical automata Ai,A 2 . Let us first assume that A is a simple branching automaton. 
First let us prove that for B < A, A ffi B = A. Let us consider the game G(A © B,A). The 
following is a winning strategy for Duplicator. While Spoiler keeps inside the head loop of 
A, mimic his actions. When he exits the head loop, let all the non-critical tokens produced 
so far copy the actions of their counterparts belonging to Spoiler, and for the critical token 
(and all new tokens to be produced) proceed as follows. If C\ © B is a canonical automaton, 
then, by the shape of the hierarchy, C\ © B < A and Duplicator may use the winning 
strategy from G{C\ © B,A). If C\ © B is not canonical, then B = Fr LjK \ © B' for some 
(l, k) 7^ (1, 1). It is very easy to see that C\ @Fu K \ @B' = Fn K \ ffi-B', and again Duplicator 
can use the winning strategy from G{C\ © B,A). 

Let us assume now that B = B\ ffi B2 © . . . ffi B n where B{ are simple and B\ > A. 
Suppose B = Cu"r) for some r\ < u;^' 3 . Then A © B = B. Indeed, consider the game 
G(A © B,B). While Spoiler keeps inside A, Duplicator should keep in B\ and apply the 
strategy from G(A, B\). Suppose Duplicator enters B. Since B\ > A, it holds that B 
dominates A and we may assume that Spoiler has removed his non-critical tokens before 
entering B. From now on Duplicator may simply mimic Spoiler's behaviour. 

An analogous argument shows that for B\ = D^ri, we get A(B B = B. For B\ = E^^, 
A © B is a canonical automaton (up to a permutation of the input alphabet). 

Now, consider B = B% ffi ... ffi B n > A, Bi simple and B\ < A. By the definition of 
canonical automata, B\ < B% < . . . < B n , and since B > A, B n > A. Let k be the least 
number for which B k > A. Let B' = B\ ffi ... ffi B k _ x and B" = B k © . . . ffi B n . In order 
to reduce this case to the previous one it is enough to check that A © B < A © B" (the 
converse inequality is obvious). Consider G{A © B, A ffi B"). While Spoiler's critical token 
stays inside A © B', Duplicator follows the strategy from G(A © B', A). If Spoiler does not 
leave A ffi B', he loses. Suppose that Spoiler finally enters B". Note that B" dominates 
A and B\, . . . ,B k -i- Hence, by Lemma Il0.2( we may assume that Spoiler removes all his 
non-critical tokens on entering B" . Duplicator should simply move his critical token to the 
initial state of B" and mimic Spoiler's actions. 

Suppose now that A = F^^ or A = F^^ V Let B = B% ffi . . . ffi B n with Bi 

simple. For 1 < k proceeding like in Proposition 18.31 (page [T9l) one proves that 

(1) B < A =^ A®B = A, 

(2) B x = Fj^ => A® B = F M ffi (F M V F^) © B 2 ffi . . . ffi B n G C, 

(3) A < B 1 = (F WtKl) V Fjjrjj) => A ffi B G C, ' 

(4) A < B l = F WiKf) =^ A®B = B, 

(5) A < B x = (Ci -» B[) =^ A(BB = B. 

In the remaining case, B% < A < B, argue like for branching A. 
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For l = k, the implications (2), (3), and (4) also hold, and give a canonical form if B\ 
is non-branching. If B\ is branching, A® B = B for A = Fnu, and A® B = F( 0Q \ ® B € C 
for A€{F m ,F m Vf (1)1) }. 

Finally let A = A\ ffi A 2 ffi . . . ffi _A r , where Aj are simple. Using the fact that © 
is associative up to =, and Lemma 15.51 (page [12]) . we get {A\ © A2 © ... © A r ) © B = 
(Ai ffi A 2 ffi ... ffi A r _i) © (A r © 5) = (Ai © A 2 © • • • © ^r-i) © B' where 5' is a canonical 
automaton equivalent to {A r © Repeating this r — 1 times more we obtain a canonical 
automaton equivalent to A © B. □ 

In the following proofs we will need the following property. For simple branching 
automata B = (C\ — > C a ), let B~~ = Di —> C a . 

Lemma 13.3. For every A € C and every simple branching B one can find in polynomial 
time a canonical automaton equivalent to B~ © A. 

Proof. B is simple branching, so B = C a where a = u k or a = oj UJ ' 2+k . Let A = S © A' , 
where A' € C and 5 is a simple automaton. Suppose first that S is a branching automaton. 
Then S = © d and A = © C x © A' with /? = u j or (3 = w u ' 2+i . Let us check that 
Cq © C/3~ © Ci © A' = C~ ax(a ^ ffi Ci ffi A'. Consider the following strategy for Duplicator in 
G(C~ © Cp © Ci © A', C max(Q ^ © Ci © A'). While Spoiler's critical token 2 can reach the 
head loop of C~ or , Duplicator may keep his critical token y looping in the head loop 
of his automaton C~ ax ( a ^ . For every new token produced by Spoiler in the head loop of 
C~ or Cp , Duplicator produces a doppelgdnger in the head loop of C 1 "^^ ^. When Spoiler 
moves his critical token x to C\ © A', Duplicator does the same with y and lets it copy x's 
actions. As the converse inequality is obvious, C 1 "^^ ^ © C\ © A' = C m£LX ( a ^ © A 1 gives 

the canonical form for C~ © A. 

Now, let S be non-branching. Suppose first that S is one of the automata D^+k, 
C^+u, E u u,+k for k > 0. If a = u k , C~ © 5 © A' < C a © S © A' = S © A' by the 
proof of the closure by ffi. The converse inequality is obvious. Similarly, if a = uj UJ ' 2+k , 
C a © S © A 5^ © $ © A = © A . The converse inequality is obvious again. 

The remaining possible values for S are C±, D\ and E\. If S = C\, C~®C\®A = C a ®A, 
and the canonical automaton is obtained via closure by ffi. For S = E\, observe that 
C~ ffi Ei ffi A' < C~ k ffi C 2 ffi A' = C^k ffi Di ffi A. By the proof of the closure by ffi we get 
C^k ffi Di ffi A = C^k ffi A. Hence C~ k ffi E x ffi A < C w k ffi A. The converse inequality is 
obvious. Finally, if S = D\, we get C~ ffi Z?i ffi A' = C~ ffi A'. By the structure of canonical 
automata, A' must start with E\ or C^u+k. In both cases we can use one of the previous 
cases to get an equivalent canonical automaton. 

If A = S the whole argument is analogous, only in the last case, for S = D\, we have 
C~®D 1 = D 1 . □ 

Proposition 13.4. For A,B € C one can find in polynomial time an automaton in C 
equivalent to A A B. 

Proof. We will proceed by induction on (A, B) with the product order induced by <. Let 
A = A 1 ®A 2 ®...®A m , B = B 1 ®B 2 ®...®B n with Aj, Bj simple. Let A' = A 2 ®. . .®A m 
for m > 1 and B' = B 2 ffi . . . ffi B n for n > 1. 

First, assume that B\ = C\ — > Cp, and either A\ = F^ K for some (l, k), or A\ = C\ — > 
C a for a < p. Let m, n > 1. Let us see that A A £ = ffi (A' A B V A A Ci ffi B'). In 
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the first move Spoiler produces token x in A and x in B. While x stays in A\ and x 
stays in the head loop of B±, Duplicator should keep his critical token in the head loop of 
By and for each x, a child of x B or x A , produce a token y whose task is to play against x. 
The token x after being produced is put in the head loop of Cp or, if A\ = C\ — > C a , in the 
head loop of C a . The token y is put in the head loop of Cp. Since a < f3, y can adapt the 
strategy from G(C a ,Cp) if x is in C a , or simply copy x's actions if x is in Cp. Now, two 
things may happen. If x A enters A' while x B stays in the head loop of Bi, Duplicator should 
move his critical token to A' A B and split it into y A sent to A 1 and y B sent to B. Then 
y A should mimic x A , and y B should mimic x B . If x B exits the head loop of B\, Duplicator 
should move to A A C\ ©£>', produce two tokens, and mimic Spoiler's actions. The converse 
inequality is even simpler. In a similar way we prove A A B = B^ ffi (A A C\ ffi B') for 
n > m = 1, A A B = B^ © (A' A B V ^ A Ci) for m > n = 1, and A A B = #f © (A A Ci) for 
m = n = 1. In all four cases using the induction hypothesis, the closure by V, ffi, and the 
Substitution Lemma (Lemma 15.51 P a g e fl2j) we obtain an automaton of the form Sj~ ffi C, 
where C is canonical. Lemma 113.31 gives an equivalent canonical automaton. 

Next, suppose that A\ — F^ K ^, B\ — F( L ' iK 'y Assume m,n > 1. Using Lemma [9^2] one 
proves easily that AAB = i ? ( ljK ) A (,.' A ') ffi {(A A B') V {A 1 A B)). Similarly, for m > 1, n = 1, 
we have A A B = i*( t)/e )A(t',«') © {A' A B) and the canonical form follows from the induction 
hypothesis. For m = 1, n > 1 proceed symmetrically. For m = n = 1, A A B = i ? ( tiK )A(t',K / )- 
Again, using the induction hypothesis, the closure by V, ffi, and the Substitution Lemma, 
we get an equivalent canonical automaton. 

The general case may be reduced to one of the special cases above, because E a A A = 
(C a A A) V (D a A A). □ 

Since (l, k) -replication requires a rather involved analysis, let us first consider — >. 

Proposition 13.5. For A,B € C one can find in polynomial time an automaton in C 
equivalent to A^> B. 

Proof. First, let us deal with two special cases for which the general method does not work. 
For B ^ C3 simple calculations give the following equivalences: A — > B = (D\ @A)AB for 
B e {C 1 ,E 1 ,C 2 ,D 2 ,D 3 }, A -»■ Di = AyD 1 ,A^E 2 = A -> D 3 . By the Substitution 
Lemma, the equivalent canonical forms follow from the closure by ffi, V, and A. 

The second special case is when B contains non-trivial flowers but B ^ -^(0,2) • First, let 
us see that A -> F( ,i) = (Di ffi A) A F( ,i). The inequality A -> F (0jl) > (D\ ffi A) A F( ,i) 
follows easily from Lemma 19.11 For the converse it remains to observe that the following 
strategy is winning for Duplicator in G(A — > ^(0,1)1 (-^1 © A) A F( 0) i)): in D\ ffi A mimic 
Spoiler and in ^(0,1) a Pply the strategy from G{C\ — > i^o 1) > -^(0,1) ) given by Theorem 112.21 
(3 and 4). An analogous argument shows that A — > -^(1,2) = (-^l ffi ^4) A ^(1,2)- For the 
remaining possible values of -B we will show ^4 — > B = (Di © A) A -F(o,i) A ^(1,2) ■ Again, 
A ^ B > (Di ® A) A B A B > (Di @ A) A F (01 ) A F( 12) is easy. For the converse, 
observe that B only uses ranks 1,2,3. Consider the following strategy for Duplicator in 
G(A — ► B, (Di ffi ^4) AF( 0] i) A-F( 1)2 )). In the component D\ (&A simply mimic the behaviour 
of Spoiler's critical token. In ^(0,1) use the strategy from G(C\ — > B',Fr 0il \), where B' 
denotes B with ranks 1 and 2 replaced by and rank 3 replaced by 1. In Fn. 2) use the 
strategy from G(C\ — > B",F(i t2 )), where jB" denotes i? with all 3's replaced by l's. The 
combination of these three strategies is winning for Duplicator. 
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For the remaining automata, we will show that what really matters is the maximal 
simple branching automaton contained in C\ — > B. There are two main cases: either 
C wfc -i < B < C^. (C 3 <B< C u for k = 1), or C^. 2+{ k-i) < B < (F (0 , 2 ) < B < 

for k = 1). In the first case ^4 — > B = C~ k ®A, in the second case A — » S = C~ w . 2+fc ©A 
Since the proofs are entirely analogous, we will only consider the first case. We only need 
to argue that A — > i? < Cj fc © A, since the converse inequality is obvious. 

Let us start with B = C^k. Denote the head loop of C^k by Ao- It is enough to show 
a winning strategy in G(A — > B, C~ k © A). Since no path from the head loop of A — > B 
to Ao goes through an accepting loop, Duplicator may keep his critical token in the head 
loop of C~ k as long as at least one of Spoiler's tokens can reach Ao- Hence, for every token 
produced by Spoiler in Ao, Duplicator can produce a doppelganger. When none of Spoiler's 
tokens can reach Ao any more, Duplicator moves his critical token to A and mimics Spoiler. 

Let us now suppose that C w k-i < B < C^k, k > 2 (for k = 1 the proof is very similar). 
The strategy for Duplicator in G(A — * B, C~ k © A) is as follows. Let m be such that B < 
C u k-i m . For every token xi produced by Spoiler using the head loop of A — > B, Duplicator 
produces m tokens y\ , . . . , yf 1 using the head loop of C~ k . Then the tokens y\ , . . . , y™ 
play against Xi simulating Duplicator's winning strategy from G(B, [C% © C^fc-i)" 1 ). When 
Spoiler moves his critical token to A, Duplicator does the same and keeps mimicking Spoiler 
in A. 

Thus we managed to simplify A — > B to C~ © A where a = u> k or a = uj u ' 2+k . An 
equivalent canonical automaton is provided by Lemma I13.3L □ 

Now we are ready to deal with (t, /-^-replication. Since C^s = C\ I C u ^+i, the class 
(i,k) 

C is not closed by However, adding the three top canonical automata is enough to get 
the closure property. 

Proposition 13.6. For A,A L , . . . , A K £ C, i, k < u, one can find in polynomial time an 
automaton in C equivalent to A A L , . . . , A K . 

Proof. Let B = A A L , . . . , A K . If B admits any of the automata C^u -3 +1 , C u ^-3 +2 : 

then it is equivalent to the maximal one it admits (see Theorems 112.21 and I12.4p . Let us 
assume B admits none of the three automata above. Let us also assume that i < k. 

1. If some Ai contains a (0,l)-flower and some Aj contains a (l,2)-flower, then B = 
(F M ®A)A F (li2) A F (0 ,i). It is easy to show that (F M © A) A F (1>2) A F (0jl) < B (c. f. 
Lemma l9.ip . We shall concentrate on the converse inequality. From the hypothesis that 
B does not admit C^-3 +1 , it follows easily that k must be odd and A L , . . . ,A K -i must be 
(1,2) automata. Furthermore, since B does not admit C w ws, A^ uses only ranks 1,2,3. 
The strategy for Duplicator in G(B, (Fu K -\ © A) A -^(1,2) A ^(0,1)) is analogous to the one 
used in the proof of the previous proposition. In the component Fu K \ © A simply mimic 
the behaviour of Spoiler's critical token. In ^(0,1) > loop around the 1-loop whenever Spoiler 
loops around the 1-loop of a (0, l)-flower in A K (again, if the run is to be accepting, this may 
happen only finitely many times), otherwise loop around 0-loop. For the strategy in ^(1,2); 
treat all the ranks appearing in Spoiler's Fu K \ or A as 2's, and the 3's in A K as l's. Seen 
this way, B is a (1, 2)-automaton, and by Theorem 112.21 Spoiler's actions can be simulated 
in F( lj2 ). 
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2. If Ai contain only (1,2) -flowers, then B = (F( iyK ) © A) A ^(1,2) ■ This is proved just like 
the first case. 

3. If Ai contain only (0, l)-flowers, then B = (A — > A t , . . . , A K -i, C\) AF( 0) i) fuse case ^ or 
5 to gei a canonical form) . Like in the first case, k must be odd, A L , . . . , A K _\ must be (1, 2)- 
automata. Consequently, it must be A K that contains a (0, l)-flower. Since A K contain no 

^(0,2) (by the hypothesis no Ai does), A K = -F( 01) . Again B > ((A -U . . .,A K -i, C\) A 

F( 01 )) is easy. The strategy for Duplicator in G(B, (A A L , . . . , Ci) A ^(0,1)) is to 

copy Spoiler's actions in A A t , . . . , A K -\,C\ and in -F(o,i) keep record of all l's appearing 
in A K (if the run is to be accepting, there may be only finitely many altogether). 

4- If Ai contain no non-trivial flowers, 1 = 0, and A L contains a D2, then B = (Fu K \ ®A) A 
^(1,2) ■ The inequality B < {Fu K -\ © A) A ^(1,2) is proved just like in the first case. Let us 
see that the converse holds. Consider the game G((F^ ^ © A) AF^^B) and the following 
strategy for Duplicator. Copy Spoiler's actions in Ft hK \ © A, but whenever Spoiler enters 
the 1-loop in (1, 2), loop once around 0-loop, move the extra token to the head loop of D2, 
and keep looping around until Spoiler leaves his 1-loop. Then remove your extra token, and 
so on. It is easy to see that the strategy is winning for Duplicator. 

5. If Ai contain no non-trivial flowers and either 1 7^ or A L contains no D2, then B = 
Fu,k) © A. To prove it, we have to describe the strategy for Duplicator in G(B, Fu K -\ © A). 
During the whole play keep numbering the new tokens produced by Spoiler according to 
their birth time. (As usual, the left token is considered a parent, the right token is born, 
transitions of the form p — > T, q or p — > q, T do not produce new tokens.) The strategy 
is as follows. While there are no new tokens in rejecting loops in A L , . . . , A K , keep copying 
Spoiler's moves in his Fu K )- When the first new token, say x^, enters a 1-loop, start looping 
around the 1-loop of your Fu K \ (the loop exists since 1 < ft), and keep doing it until Xi L 
leaves the 1-loop. If it does not happen, Spoiler will lose. When it does happen, stop 
looping around 1-loop. Investigate all the ranks used by Spoiler in (l, K)-flower while you 
were simulating Xi L , choose the highest one, say k, and loop once a fc-loop. Afterwords, if 
there are no tokens in rejecting loops in Ai, copy Spoiler's moves. Otherwise, choose the 
token with the smallest number, say Xi 2 , start looping around the loop with the highest 
rank 1 in your (i, /t)-flower, and so on. 

Let us see that if Spoiler does not enter A, he loses the game. If the run constructed 
by Spoiler is to be rejecting, either the highest rank used infinitely often in F^ >K ^ is odd, or 
some token stays forever in a rejecting loop in one of A L , . . . , A K . In any case Duplicator's 
strategy guarantees a rejecting run for him as well. Let us suppose that Spoiler's run is 
accepting. If only finitely many new tokens entered rejecting loops in A t , . . . , A K , then there 
was a round such that from this round on Duplicator was simply mimicking Spoiler's actions 
in Fu K \ and so Duplicator's run is also accepting. Suppose that infinitely many new tokens 
visited rejecting loops in A t , . . . , A K . We have assumed that either 1 7^ or A L contains no 
L>2- In either case the ranks greater then must have been used infinitely many times in 
F(l, k )- Consequently, the highest rank used in F^^ is greater then 1, and Duplicator's run 
is accepting despite infinitely many l's used in Fu K y 

Suppose now that Spoiler leaves F^ K y Following the argument used in the proof of the 
closure by ffi, we may suppose that the simple automaton containing the head loop of A is at 
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least a (l, k). When Spoiler enters A, he may produce no more tokens in A L , . . . , A K . From 
now on Duplicator should mimic Spoiler's behaviour in his copy of A, handling rejecting 
loops in A L , . . . , A K in the usual way. 

What is left is the case i = n. If k is odd, B = A — > A± . If k is even, Aq must be a (1, 2)- 
automaton. In the cases 2 and 4 proceed just like before. In the case 5, the automaton 
A cannot contain D 2 . If A € {C\, Z?i, C 2 }, then B = (Ci © A) A A . If A Q = Ex, then 
B = Ci®{AvCi). □ 

The following corollary sums up the closure results. 

Corollary 13.7. The class of canonical automata C is closed by V, ©, A, — L - >, and the 
equivalent automaton can be found in polynomial time. 

Proof. The claim is an almost immediate consequence of the preceding propositions. Only 
the automata C w w8, C w w3 +1 , C w w3 +2 need special care: if the result of the operation 
in question admits any of these automata, it is equivalent to the hardest one it admits 
(Theorems TT2\ and gZgD . □ 



14. Completeness 

In this section we show that the canonical automata represent the =w-classes of all 
deterministically recognisable tree languages. We will implicitly use Corollary 113.71 and the 
Substitution Lemma (Lemma 15.51 page [T2"j) on several occasions. 

We will say that a transition is positive if one of its branches lies on an accepting loop, 
and negative if one of its branches lies on a rejecting loop. Note that a transition may be 
positive and negative at the same time. Recall the notion of replication (see page [7]). We 
say that a state is j-replicated if it is replicated by a j-loop. An automaton is j-replicated 
if its initial state is j-replicated. 

Finally, let us recall the lifting operation invented by Niwihski and Walukiewicz and 
used to prove the decidability of the deterministic index hierarchy (Theorem 12.11 page[4|). 

Lemma 14.1 (Niwihski and Walukiewicz [7J). For each deterministic automaton A one can 
compute (in polynomial time if the productive states are given) an automaton A I ! 1 ... | n 
such that L(A) = L(A |°f 1 . . . j n ) and if a state q has the rank j < n than q lies on a j-loop 
of a (j,n) -flower. □ 

Theorem 14.2. For every deterministic tree automaton there exists an equivalent canonical 
automaton. 

Proof. Let A be a deterministic tree automaton. From Theorem 112.21 (7) it follows that if 
A admits C^ 3 +2 , A = C^-3 +2 . If A does not admit C^ 3 +2 , then by Theorem ll2.2l (5 and 
6) if A admits C^ 3 +1 , A = C^ 3 +1 . Otherwise L(A) € A3 and if A admits C^u-s, then 
A = C w u;-3 (Theorem 112. 4p . In the remaining of the proof we will assume that A admits 
none of these three automata. We will proceed by induction on the height of the DAG of 
SCCs of A. Let X denote the root SCC of A. We will say that X contains a transition 
p — ► p' ,p" , if X contains all three states, p, p' , and p" . We consider four separate cases. 
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1. X contains a positive transition. Observe that each state of A is replicated by an accept- 
ing loop. Therefore, if A admits -F(o,i)i it must also admit C\ ^> ^(0,1) = Ctj w - 3 +l> which is 
excluded by our initial assumption. Consequently, A is a (1, 2)-automaton (Theorem I2.2j) . 
Without loss of generality we may assume that A uses only ranks 1 and 2. 

By Theorem 112.21 (3 and 4), L(i ? ( lj2 )) is il^-complete and L(A) € Ilg, which implies 

that A < If A admits D\ ffi C\, then it also admits C\ ^—l T)\ © C\, and so 

A > d Dx © Ci. From Theorem MM (3) it follows that C x ^ £>i Ci = F (li2) . 
Consequently, ^4 = ^(1,2)- 

Suppose that A does not admit D\ ©Ci, but X contains a rejecting loop Ai- Let pi be 
a state on that loop. Since X contains a positive transition, it must contain an accepting 
loop and, in particular, a state with rank 2, say p%. Since X is strongly connected, we may 
find a loop A2 going from p\ to p\ via p 2 - Since X only uses ranks 1 and 2, and rank(p2) = 2, 
A2 is accepting. Hence, Ai and A2 form a (1, 2)-flower. In consequence, A > -^(1,2)- Hence, 

A = F (l,2)- 

Finally, suppose that X contains no rejecting loops and A does not admit D\ © C\. By 
Theorem (2), L(A) G n? and since L(C X © D{) is n?-complete, A < C x © Di. If A 
admits D\ it also admits C\ © Z?i, and so A = C2. If vl does not admit D\ it means that 
it contains no rejecting loop. Hence, A accepts every tree and A = C\. 

2. X contains an accepting loop and a negative transition, but no positive transitions. Let 
A + be an accepting loop in X and Ax be a loop visiting all X's nodes and containing 
a branch of the (negative) transition contained in X. Since X does not contain positive 
transitions, Ax is rejecting. The loops A+ and Ax form a (0, l)-flower. Hence, A admits 
-F(o,i)- Furthermore, should A contain a (0, 2) -flower, it would obviously be replicated by 
Ax and A would admit C\ — > ^(0,2) = C^s, which contradicts our general hypothesis. 
Hence, A does not admit -F(o,2)> which means A is a (1, 3)-automaton (Theorem 12.21 page 
[5]). Without loss of generality we may assume that it uses only ranks 1, 2, 3. 

By Theorem 112.21 (3 and 4), if A admits neither i 7 ^^) nor C\ ^—l D\ © C±, then 
A = ^(0,1) • Suppose that A admits one of these two automata. Consider the game G(F( 0j i) A 
F( 12 ),^4). Let x 1 and x 2 be Spoiler's tokens in -^(0,1) and ^(1,2)1 respectively. Since X 
contains a (negative) transition, Duplicator can split his critical token into y 1 and y 2 within 
X, and move y 1 to the (0, l)-flower in X, and y 2 to the (1, 2)-fiower, or to the accepting loop 

replicating a weak (1, 2)-fiower (if A admits C\ ^—X D\®C\). Then y 1 should mimic x 1 , and 

y 2 should mimic x 2 — either directly, or adapting the strategy from G(Fn2\ , C\ ^> D\ © 
Ci). Hence, Duplicator has a strategy to win the game. It follows that -^(0,1) A ^(1,2) — A. 
For the converse inequality, let us call the states with rank 3 contained in a (0, l)-flower 

red, and the remaining blue. Since A does not admit C\ ^> ^(0,1)) 110 re d state is replicated 
by an accepting loop. Consider the game G(A,F( ^ A jF(i 2))- For a strategy in -^(1,2) 
Duplicator should treat all the red states as if they had rank 1; the automaton A modified 
this way does not admit i^o.i)) so Duplicator may use the strategy given by Theorem 112.21 
(3 and 4). In -^(0,1) Duplicator should loop a 1-loop whenever some Spoiler's token is in 
a red state. Otherwise, Duplicator should loop a 0-loop. Let us see that this strategy is 
winning. 
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Suppose that Spoiler's run is accepting. After changing the ranks of red states from 3 
to 1 it is still accepting, so Duplicator's token in i^i^) visited an accepting path. By the 
Replication Lemma (Lemma 19.11 page [2T]) . the occurrences of red states in Spoiler's run 
may be covered by a finite number of paths. Furthermore, each of these paths is accepting, 
so it may only contain a finite number of red states. Hence, there may be only finitely 
many red states in Spoiler's run and the path visited by Duplicator's token in i^o.i) is also 
accepting. 

Suppose now, that Spoiler's run is rejecting. If red states occurred only finitely often, 
Spoiler's run is still rejecting after changing their ranks to 1, so Duplicator's token in F^^) 
visited a rejecting path. If there were infinitely many red states in Spoiler's run, Duplicator's 
token in -F(o,i) visited a rejecting path. 

Hence, A = i*(o,i) A ^(1,2) and by Lemma [972], A = Fn$\. 

3. X contains some transitions but no accepting loops. Let — ^ q'^q", i = l,...,n be 

all the transitions such that qi G X and q[, q*l £ X. Let pj ^* pj j = 1, . . . , m be all the 
remaining transitions such that pj £ X and p'j £ X. We will call the automata (A) q i, {A) q >/ 
and (A) p i, the child automata of X. By the induction hypothesis we may assume that they 
are in canonical forms. Let B = ((A)g> A (A) q ») V ... V {{A) q , n A {A) q ») V (A) p i V ... V (A)p> m . 
It is not difficult to see that A is equivalent to C\ —* B. 

4- X contains no transitions. Recall that this means exactly that at most one branch of 
every transition stays in X. First replace subtrees rooted in the target states of transitions 
whose all branches leave X with one canonical automaton B just like above. Let (t, n) denote 
the highest index of a flower contained in X. It is well defined, because a strongly connected 
component admitting jP(oj) and Fnj +1 ) must also admit i^oj+i)- We may assume that X 
uses only ranks l, . . . ,k, and that each j-loop is indeed a j-loop in a (j, ft)-flower (Lemma 
114. 1|) . For each j = 1, . . . , k, let Bj be the alternative of all the child automata replicated 

by a j-loop in X. By induction hypothesis, we may assume that B L , ...,B K and B are 

(i,k) 

canonical automata. Let A' = B — — > B b , . . . , B K . We will show that A = A' . 

If 1 = k, the assertion is clear. Suppose that 1 < k. Obviously, A' > A. Let us see that 
A' < A. Let A" denote the result of the following simplifications performed on A' . 

• If some Bi contains a (0, l)-flower and some Bj contains a (1, 2)-flower, replace B K with 
a (1, 3)-flower. 

• If some Bi contains a (0, l)-flower and no Bj contains a (1, 2)-flower, replace B R with 
a (0, l)-flower. 

• If some Bi contains a (l,2)-flower and no Bj contains a (0, 1) -flower, replace B R with 
a (1, 2)-flower. 

• If B L , . . . , B K admit no Fu K \ with 1 < k, remove B K . 

• If l = and B L admits D2, replace B L with D<i- Otherwise, remove B L . 

• Remove all B l+ \, . . . , B re _i. 

Examination of the five cases considered in the proof of Proposition 113.61 reveals that A' 
and A" have identical canonical forms. Consequently, A 1 = A", and it is enough to show 
that A" < A. Consider all (t, «;)-flowers in X. Choose any one whose t-loop replicates D2, 
if there is one, or take any (i, K)-flower otherwise. Then, extend the K-loop to a loop using 
all the transitions in X. Denote this flower, together with the subtrees replicated by i-loop 
or K-loop, by F. One can prove easily that A" < F © B, and obviously F © B < A. Q 
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Algorithm 1 The canonical form of deterministic tree automata 



1: if A admits C w w3 +2 then 

2: return C^-3 +2 

3: else if A admits C w w3 +1 then 

4: return C^ws+i 

5: else if ^4 admits then 

6: return C^, 

7: else 

8: X := the root SCC of A 

9: if X contains a positive transition then 

10: if A admits F( lj2 ) or ,4 admits D 2 then 

11: return ^(1,2) 

12: else if ^4 admits D\ then 

13: return C 2 

14: else 

15: return Ci 

16: end if 

17: else if X contains a negative transition then 

18: if X admits C\ then 

19: if A admits ^(1,2) or A admits ^> D 2 then 

20: return ^(1,3) 

21: else 

22: return -F(o,i) 

23: end if 

24: else 

25: i? := the alternative of the canonical forms of X's children 

26: return Ci —> B 

27: end if 

28: else {X contains no transitions} 

29: B := the alternative of the canonical forms of X's non-replicated children 

30: lift ranks in X 

31: (i, k) := the index of the maximal flower 

32: for j '.= l to k do 

33: Bj := the alternative of the canonical forms of X's j-replicated children 

34: end for 

(l k) 

35: return B B t ,...,B K 

36: end if 

37: end if 



From the proof of the Completeness Theorem one easily extracts an algorithm to cal- 
culate the canonical form of a given deterministic automaton (Algorithm 1). 

Corollary 14.3. For a deterministic tree automaton, a Wadge equivalent canonical au- 
tomaton can be calculated within the time of finding the productive states of the automaton. 

Proof. It is easy to see that the size of the canonical forms returned by the recursive calls of 
each depth is bounded by the size of A (up to a uniform constant factor). To prove the time 
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complexity of the algorithm assume that the productive states of A are given. Checking if 
A admits any of the automata mentioned in the algorithm can be easily done in polynomial 
time. The operations on the automata returned by the recursive calls of the procedure 
(lines 25, 26, 29, 33, and 35) are polynomial in the size of those automata, and by the 
initial remark also in the size of the automaton. By Lemma [14. li the lifting operation is also 
polynomial. Therefore, when implemented dynamically, this procedure takes polynomial 
time for each SCC. Processing the entire automaton increases this polynomial by a linear 
factor. □ 

Instead of a canonical automaton, the algorithm above can return its "name", i. e., 
a letter C, D, or E, and an ordinal a < w w ' 3 + 2 presented as a polynomial in lu w , with 
the coefficients presented as polynomials in u. Since for such presentation it is decidable in 
linear time if a < /?, as an immediate consequence of Corollary 114.31 and Theorem 110.51 we 
get an algorithm for Wadge reducibility. 

Corollary 14.4. For deterministic tree automata A, B it is decidable if L{A) <w L(B) 
(within the time of finding the productive states of the automata). □ 
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